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5 NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO HELICOBA CTER 
PYLORI FOR DIAGNOSTICS AND THERAPEUTICS 

Background of the Invention 
Helicobacter pylori is a gram-negative, S-shaped, microaerophilic bacterium that 
1 0 was discovered and cultured from a human gastric biopsy specimen. (Warren, J.R. and B . 
Marshall, (1983) Lancet \: 1273-1275; and Marshall et al., (1984) Microbios Lett. 25: 83- 
88). H. pylori has been strongly linked to chronic gastritis and duodenal ulcer disease. 
(Rathbone et. al., (1986) Gut 27: 635-641 ). Moreover, evidence is accumulating for an 
etiologic role of//, pylori in nonulcer dyspepsia, gastric ulcer disease, and gastric 
15 adenocarcinoma. (Blaser M. J., (1993) Trends Microbiol. 1 : 255-260). Transmission of the 
bacteria occurs via the oral route, and the risk of infection increases with age. (Taylor, 
D.N. and M. J. Blaser, (1991) Epidemiol. Rev 13: 42-50). H. pylori colonizes the human 
gastric mucosa, establishing an infection that usually persists for decades. Infection by H. 
pylori is prevalent worldwide. Developed countries have infection rates over 50% of the 
20 adult population, while developing countries have infection rates reaching 90% of the 

adults over the age of 20. (Hopkins R. J. and J. G. Morris (1994) Am. J. Med. 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and for 
virulence of this pathogen, are poorly understood. Examples of the putative virulence 
factors include the following: urease, an enzyme that may play a role in neutralizing gastric 
25 acid pH (Eaton et al., (1991) Infect. Immunol. 59: 2470-2475; Ferrero, R.L. and A. Lee 
(1991) Microb. Ecol. Hlth. Dis. 4: 121-134; Labigne et al., (1991) J. Bacterial. 173 : 1920- 
1931): the bacterial flagellar proteins responsible for motility across the mucous layer. 
(Hazell et al., (1 986) J. Inf. Dis. 153: 658-663; Leying et al., (1992) Mol. Microbiol. 6: 
2863-2874; and Haas et al., (1993) Mol. Microbiol. 8: 753-760); Vac A, a bacterial toxin 
30 that induces the formation of intracellular vacuoles in epithelial cells (SchmitL W. and R. 
Haas, (1994) Molecular Microbiol. 12£2): 307-319); and several gastric tissue-specific 
adhesins. (Boren et al., (1993) Science 262: 1 892-1 895; Evans et al., (1 993) J. Bacterial. 
175 : 674-683; and Falk et al., (1993) Proc. Natl. Acad. Sci. USA 90: 2035-203). 

Numerous therapeutic agents are currently available that eradicate H. pylori 
35 infections in vitro. (Huesca et. al., (1993) Zbl. Bakt. 280: 244-252; Hopkins, R. J. and J. G. 
Morris, supra). However, many of these treatments are suboptimally effective in vivo 
because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
combined with bismuth are part of the standard regime used to treat H. pylori infection. 
40 (Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics 15 Supp. B: 
37-48). Recently, combinations of proton pump inhibitors and a single antibiotic have been 
shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. Dominguez- 
Munoz supra). However, methods employing antibiotic agents can have the problem of the 
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5 emergence of bacterial strains which are resistant to these agents. (Hopkins, R. J. and J G 
Moms, supra). These limitations demonstrate that new more effective methods are needed 
to combat H. pylori infections in vivo. In particular, the design of new vaccines that may 
prevent infection by this bacterium is highly desirable. 

^ Summary of the Invention 

This invention relates to novel genes, e.g., genes encoding bacterial surface 
proteins, from the organism Helicobacter pylori, and other related genes, their products 
and uses thereof. The nucleic acids and peptides of the present invention have utility for 
diagnostic and therapeutics for H. pylori and other Helicobacter species. They can also be 
used to detect the presence of H. pylori and other Helicobacter species in a sample; and for 
use ,n screening compounds for the ability to interfere with the H. pylori life cycle or to 
inhibit H. pylori infection. More specifically, this invention features compositions of 
nucleic acids corresponding to entire coding sequences of H. pylori surface proteins or parts 
thereof, nucleic acids capable of binding mRNA from H. pylori surface proteins to block 
protein translation, and methods for producing H. pylori surface proteins or parts thereof 
using peptide synthesis and recombinant DNA techniques. This invention also includes 
antibodies and nucleic acids sequences useful as probes to detect H. pylori infection. In 
addition, vaccine compositions and methods for the protection against infection by H 
pylori are described. 
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Brief Desc ription of the Drawinp s 
Figure 1 is a table which contains information from homology searches performed 
on the sequences of this invention using the BLAST algorithm contained in the Wisconsin 
Sequence Analysis Package. 

Figure 2 is a bar graph that depicts the antibody titer in serum of mice following 
immunization with specific H. pylori antigens. 

Figure 3 is a bar graph that depicts the antibody titer in mucous of mice following 
immunization with specific H. pylori antigens. 

Figure 4 is a bar graph that depicts therapeutic immunization of//, pylori infected 
35 mice with specific antigens dissolved in HEPES buffer. 

Figure 5 is a bar graph that depicts therapeutic immunization of//, pylori infected 
mice with specific antigens dissolved in buffer containing DOC. 



40 



Detailed fWriptinn 0 f the Invention 
In one aspect, the invention features a substantially pure nucleic acid encoding an 
H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 15, such as a nucleic 
acid comprising a nucleotide sequence of SEQ ID NO:l . 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 16, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:2. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 7, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:3 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO:l 1 8, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:4. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 9. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:5. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 120, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:6. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 121, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:7. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 122, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:8. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 123, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:9. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 124, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 10. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 125, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 1 . 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 126, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 12. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 127, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 3 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 128, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 14. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 129, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 15. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:130, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 16. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 131, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 17. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 32, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 18. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 133, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 19. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 34, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:20. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 35, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:21 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 136, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:22. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 37, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:23. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 138, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:24. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 139, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:25. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:140, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:26. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:141, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:27. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:142, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:28. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:143, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:29. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:144, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:30. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 145, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:31. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 146, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:32. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 47, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:33. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 48, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:34. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:149, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:35. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 50, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:36. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 5 1 . such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:37. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 52. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:38. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 153. such as a 
10 nucleic acid comprising a nucleotide sequence of SEQ ID NO:39. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 54, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:40. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 55, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:41. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 56, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:42. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 57, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:43. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 58, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:44. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 159, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:45. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 160. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:46. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO : 1 6 1 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:47. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 162. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:48. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 163, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:49. 



30 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:164, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:50. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 165, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:5 1 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 166, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:52. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 67. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:53. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 168, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:54. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 69, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:55. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 70, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 56. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 71, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:57. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 72, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:58. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 73, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:59. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 174, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:60. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:175, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:61 . 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 76, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:62. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 177, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:63. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 78, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:64. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 79, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:65. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 180, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:66. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:181, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:67. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 182, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:68. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 183, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:69. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 84. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:70. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 85, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:71 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 186, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:72. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 87, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:73. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 188. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:74. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 189, such as a 
] 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:75. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 190, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:76. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 191, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:77. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 192, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:78. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 193, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:79. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 194, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:80. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 195, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:81 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an E pylori polypeptide having an amino acid sequence of SEQ ID NO:196 ? such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:82. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 197, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:83. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 198, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:84. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 199, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:85. 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:200, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:86. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:201 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:87. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:202. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:88. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:203, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:89. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:204, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:90. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:205, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:91 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:206, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:92. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:207, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:93. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:208, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:94. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:209, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:95. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 0, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:96. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 1, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:97. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:212, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:98. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an E pylori polypeptide having an amino acid sequence of SEQ ID NO:213, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:99. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:214, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 100. 

In another aspect , the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:2 15, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 101. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO:216, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 102. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 7, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 103. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:2 1 8, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 104. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 9, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 105. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:220, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 106. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:221 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 107. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:222, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 108. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:223, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 09. 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:224, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 11 0. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:225, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 1 1 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:226. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 1 2. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:227, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:l 13. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID N0.228. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 114. 

In another aspect, the invention comprises nucleic acids capable of binding mRNA 
of//, pylori. Such nucleic acid is capable of acting as anti-sense nucleic acid to control the 
translation of mRNA of//, pylori. A further aspect features a nucleic acid which is capable 
of binding specifically to an H. pylori nucleic acid. Such nucleic acid has utility as probes 
and as capture reagents. 

In another aspect, the invention features an expression system comprising an open 
reading frame corresponding to H. pylori nucleic acid. The nucleic acid further comprises a 
control sequence compatible with an intended host. The expression system is useful for 
making polypeptides corresponding to //. pylori nucleic acid. 

In another aspect, the invention features a cell transformed with the expression 
system to make H. pylori polypeptides. 

In another aspect, the invention features a method of generating antibodies against 
H. pylori proteins which are capable of binding specifically to H. pylori proteins. Such 
antibody has utility as reagents for immunoassays to evaluate the abundance and 
distribution of//. /?y/or/-specific antigens. 

In another aspect, the invention features a method of generating vaccines for 
immunizing an individual against H. pylori. The method includes: immunizing a subject 
with an H. pylori protein, e.g., a surface protein, or portion thereof, and a pharmaceutically 
acceptable carrier. Such vaccines have therapeutic and prophylactic utilities. 

In another aspect, the invention provides a method for generating a vaccine 
comprising a modified immunogenic H. pylori protein, e.g., a surface protein, or portion 
thereof, and a pharmacologically acceptable carrier. 
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5 In another aspect, the invention features a method of evaluating a compound, e.g. a 

polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an H. pylori 
polypeptide. The method includes: contacting the candidate compound with an H. pylori 
polypeptides and determining if the compound binds or otherwise interacts with an K 
pylori polypeptide. Compounds which bind H. pylori are candidates as activators or 

10 inhibitors of the bacterial life cycle. These assays can be performed in vitro or in vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g. a 
polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an K pylori 
nucleic acid, e.g., DNA or RNA. The method includes: contacting the candidate 
compound with an H. pylori nucleic acid and determining if the compound binds or 

1 5 otherwise interacts with an H. pylori polypeptide. Compounds which bind K pylori are 
candidates as activators or inhibitors of the bacterial life cycle. These assays can be 
performed in vitro or in vivo. 

The invention features, H. pylori polypeptides, preferably a substantially pure 
preparation of an K pylori polypeptide, or a recombinant H. pylori polypeptide. In 

20 preferred embodiments: the polypeptide has biological activity; the polypeptide has an 

amino acid sequence at least 60%, 80%, 90%, 95%, 98%, or 99% homologous to an amino 
acid sequence contained in SEQ ID NOs:l 15-228; the polypeptide has an amino acid 
sequence essentially the same as an amino acid sequence in SEQ ID NOs:l 15-228; the 
polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide 

25 includes at least 5, preferably at least 1 0, more preferably at least 20, more preferably at 
least 50, 100, or 1 50 contiguous amino acids contained in SEQ ID NOs:l 1 5-228. 

In preferred embodiments: the K pylori polypeptide is encoded by a nucleic acid in 
SEQ ID NOs:l-l 14, or by a nucleic acid having at least 60%, 70%, 80%, 90%, 95%, 98%, 
or 99% homology with a nucleic acid of SEQ ID NOs:l-l 14. 

30 In a preferred embodiment, the subject K pylori polypeptide differs in amino acid 

sequence at 1, 2, 3, 5, 10 or more residues from a sequence in SEQ ID NOs:l 15-228. The 
differences, however, are such that: the H. pylori polypeptide exhibits an K pylori 
biological activity, e.g., the H. pylori polypeptide retains a biological activity of a naturally 
occurring H. pylori enzyme. 

35 In preferred embodiments, the polypeptide includes all or a fragment of an amino 

acid sequence contained in SEQ ID NOs:l 15-228; fused, in reading frame, to additional 
amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic 
DNA which encodes a sequence contained in SEQ ID NOs:l 15-228. 

In yet other preferred embodiments, the H. pylori polypeptide is a recombinant 

40 fusion protein having a first K pylori polypeptide portion and a second polypeptide 

portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to H. 
pylori. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a 
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5 DNA binding domain, or a polymerase activating domain. In preferred embodiment the 
fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative RNA splicing events, and alternative translational and 
postradiational events. 

10 The invention includes an immunogen which includes an H. pylori polypeptide in 

an immunogenic preparation, the immunogen being capable of eliciting an immune 
response specific for said K pylori polypeptide, e.g., a humoral response, an antibody 
response, or a cellular response. In preferred embodiments, the immunogen comprises an 
antigenic determinant from a protein contained in SEQ ID NOs:l 15-228. 

1 5 In another aspect, the invention provides a substantially pure nucleic acid having a 

nucleotide sequence which encodes an K pylori polypeptide. In preferred embodiments: 
the encoded polypeptide has biological activity the encoded polypeptide has an amino acid 
sequence at least 60%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid 
sequence contained in SEQ ID NOs:l 15-228; the encoded polypeptide has an amino acid 

20 sequence essentially the same as an amino acid sequence in SEQ ID NOs:l 1 5-228; the 

encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded 
polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more 
preferably at least 50, 100, or 150 contiguous amino acids contained in SEQ ID NOs: 1 15- 
228. 

25 In preferred embodiments: the nucleic acid is that of SEQ ID NOs: 1 -1 1 4; the 

nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous with a 
nucleic acid sequence contained in SEQ ID NOs: 1-114. 

In a preferred embodiment, the encoded K pylori polypeptide differs in amino acid 
sequence at 1, 2, 3, 5, 10 or more residues, from a sequence in SEQ ID NOs:l 15-228. The 
30 differences, however, are such that: the H. pylori encoded polypeptide exhibits a H. pylori 
biological activity, e.g., the encoded H. pylori enzyme retains a biological activity of a 
naturally occurring K pylori. 

In preferred embodiments, the encoded polypeptide includes all or a fragment of an 
amino acid sequence contained in SEQ ID NOs: 1 15-228; fused, in reading frame, to 
35 additional amino acid residues, preferably to residues encoded by genomic DNA 5* to the 
genomic DNA which encodes a sequence contained in SEQ ID NOs:l 1 5-228. 

In preferred embodiments, the subject K pylori nucleic acid will include a 
transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
transcriptional enhancer sequence, operably linked to the K pylori gene sequence, e.g., to 
40 render the H. pylori gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment, the nucleic acid which encodes an H. pylori 
polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe 
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5 corresponding to at least 12 consecutive nucleotides contained in SEQ ID NOs:l-l 14; more 
preferably to at least 20 consecutive nucleotides contained in SEQ ID NOs:l-l 14; more 
preferably to at least 40 consecutive nucleotides contained in SEQ ID NOs:l-l 14. 

In a preferred embodiment, the nucleic acid encodes a peptide which differs by at 
least one amino acid residue from the sequences shown in SEQ ID NOs:l 15-228. 
10 In a preferred embodiment, the nucleic acid differs by at least one nucleotide from a 

nucleotide sequence shown in SEQ ID NOs:l-l 14 which encodes amino acids shown in 
SEQ ID NOs: 115-228. 

In another aspect, the invention includes: a vector including a nucleic acid which 
encodes an H />v/pn-)ike polypeptide, * 8 » an H pylori polypeptide; a host cell transfected 
15 with the vector; and a method of producing a recombinant H. pylori-like polypeptide, e.g., 
an H. pylori polypeptide; including culturing the cell, e.g., in a cell culture medium, and 
isolating the H. pylori-tike polypeptide, e.g., an H. pylori polypeptide, e.g., from the cell or 
from the cell culture medium. 

In another aspect, the invention features, a purified recombinant nucleic acid having 
20 at least 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a sequence 
contained in SEQ ID NOs: 1-1 14. 

The invention also provides a probe or primer which includes a substantially 
purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence 
which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense 
25 or antisense sequence contained in SEQ ID NOs: 1 -1 1 4, or naturally occurring mutants 
thereof. In preferred embodiments, the probe or primer further includes a label group 
attached thereto. The label group can be, e.g., a radioisotope, a fluorescent compound, an 
enzyme, and/or an enzyme co-factor. Preferably the oligonucleotide is at least 10 and less 
than 20. 30. 50, 100, or 1 50 nucleotides in length. 
30 The invention involves nucleic acids, e.g., RNA or DNA, encoding a polypeptide of 

the invention. This includes double stranded nucleic acids as well as coding and antisense 
single strands. 

The H. pylori strain, from which genomic sequences have been sequenced, has been 
deposited in the American Type Culture Collection(ATCC # 55679) as strain HP-J99. 

35 The nucleic acid sequences of this invention may be obtained directly from the 

DNA of the above referenced H. pylori strain by using the polymerase chain reaction 
(PCR). See "PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, 
Oxford, UK, 1991) for details about the PCR. High fidelity PCR can be used to ensure a 
faithful DNA copy prior to expression. In addition, amplified products can be checked by 

40 conventional sequencing methods. Clones carrying the desired sequences described in this 
invention may be obtained by screening the libraries by means of the PCR or by 
hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or 
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plaques as known in the art (see, eg, Sambrook et al., Molecular Cloning, A Laboratory 
Manual 2nd edition, 1 989, Cold Spring Harbor Press, NY). 

Nucleic acids isolated or synthesized in accordance with features of the present 
invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of proteins 
and peptides corresponding to such sequences. 

As probes, primers, capture ligands and antisense agents, the nucleic acid will 
normally comprise approximately twenty or more nucleotides for specificity as well as the 
ability to form stable hybridization products. 

Putative functions have been determined for several of the H pylori polypeptides nf 
the invention, as shown in Figure 1 . 

Accordingly, uses of the claimed H. pylori polypeptides in these identified functions 
are also within the scope of the invention. 

In addition, the present invention encompasses H. pylori polypeptides characterized 
as shown in Table 1 below, including: H. pylori outer membrane proteins, H. pylori 
periplasmic/secreted proteins, and other H. pylori surface proteins. Members of these 
groups were identified by BLAST homology searches. The H. pylori polypeptides 
identified in Table 1 are representative members of the groups identified above and are in 
no way limiting. Additional members of the groups can be identified within the H. pylori 
polypeptides disclosed herein by the methods known to those skilled in the art. 



SfcQ ID NO 
Outer Membrane Proteins 


Blast 
identifier 


Gene 
Symbol/Name 


Description 


7116626 (SEQ ID NO:223) 


P26093 


e(P4) 


e (P4) lipoprotein attached by 
lipid in H. influenza 


29479681 (SEQ ID NO: 179) 


P13036 


fecA 


Receptor in Iron (III) dicitrate 
transport E. coli 


36126938 (SEQ ID NO: 199) 
Periplasmic/Secreted Proteins 


L12346 


copB 


Major out. memb. prot. in M. 
catarrhalis 


30100332 (SEQ ID NO: 181) 
Other Surface Proteins 


P23847 


dppA 


Penplasmic dipeptide binding 
protein in E. coli 


4821082 (SEQIDNO:2I2) 


P08089 


M protein 


M protein of group A. 
Streptococci 


978477 (SEQIDNO:228) 


L28919 


FBP54 


Surface Ag of grp A. 
Streptococci binds flbronectin 



30 



Definitions 



A purified preparation or a substantially pure preparation of a polypeptide, as used 
herein, means a polypeptide that has been separated from other proteins, lipids, arid nucleic 
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5 acids with which it naturally occurs. Preferably, the polypeptide is also separated from 
substances, e.g., antibodies or gel matrix, e.g., polyacrylamide, which are used to purify it. 
Preferably, the polypeptide constitutes at least 1 0, 20, 50 70, 80 or 95% dry weight of the 
purified preparation. Preferably, the preparation contains: sufficient polypeptide to allow 
protein sequencing; at least 1. 10, or 100 ug of the polypeptide; at least 1, 10, or 100 mg of 

10 the polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal cells, an in 
vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 
cells or microbial cells, it consists of a preparation of at least 10% and more preferably 50% 
of the subject cells. 

1 5 The metabolism of a substance, as used herein, means any aspect of the, expression, 

function, action, or regulation of the substance. The metabolism of a substance includes 
modifications, e.g., covalent or non covalent modifications of the substance. The 
metabolism of a substance includes modifications, e.g., covalent or non covalent 
modification, the substance induces in other substances. The metabolism of a substance 

20 also includes changes in the distribution of the substance. The metabolism of a substance 
includes changes the substance induces in the distribution of other substances. 

A substantially pure nucleic acid, e.g., a substantially pure DNA, is a nucleic acid 
which is one or both of: not immediately contiguous with both of the coding sequences 
with which it is immediately contiguous (i.e., one at the 5' end and one at the 3' end) in the 

25 naturally-occurring genome of the organism from which the nucleic acid is derived; or 
which is substantially free of a nucleic acid sequence with which it occurs in the organism 
from which the nucleic acid is derived. The term includes, for example, a recombinant 
DNA which is incorporated into a vector, e.g., into an autonomously replicating plasmid or 
virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate 

30 molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction 
endonuclease treatment) independent of other DNA sequences. Substantially pure DNA 
also includes a recombinant DNA which is part of a hybrid gene encoding additional H. 
pylori DNA sequence. 

Homologous refers to the sequence similarity or sequence identity between two 

3 5 polypeptide molecules or between two nucleic acid molecules. When a position in both of 
the two compared sequences is occupied by the same base or amino acid monomer subunit, 
e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules 
are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two sequences 

40 divided by the number of positions compared x 100. For example, if 6 of 10, of the 

positions in two sequences are matched or homologous then the two sequences are 60% 
homologous. By way of example, the DNA sequences ATTGCC and TATGGC share 50% 
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5 homology. Generally, a comparison is made when two sequences are aligned to give 
maximum homology. 

The terms peptides, proteins, and polypeptides are used interchangeably herein. 
As used herein, the term "surface protein" refers to all surface accessible proteins, 
e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and secreted ' 
1 0 proteins. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., 
one or more polypeptides), which is partly or entirely heterologous, i.e., foreign, to the " 
transgenic animal or cell into which it is introduced, or, is homologous to an endogenous 
gene of the transgenic animal or cell into which it is introduced, but which is desi«nrd to be 
15 inserted, or is inserted, into the cell's genome in such a way as to alter the genonJof the 
cell into which it is inserted (e.g., it is inserted at a location which differs from that of the 
natural gene or its insertion results in a knockout). A transgene can include one or more 
transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be 
necessary for optimal expression of the selected nucleic acid, all operably linked to the 
20 selected nucleic acid, and may include an enhancer sequence. 

As used herein, the term "transgenic cell" refers to a cell containing a transgene. 
As used herein, a "transgenic animal" is any animal in which one or more, and 
preferably essentially all, of the cells of the animal includes a transgene. The transgene can 
be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. This molecule may be integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. 

As used herein, the term "cell-specific promoter" means a DNA sequence that 
serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 
regulate expression of a selected DNA primarily in one tissue, but cause expression in 
other tissues as well. 

A polypeptide has H. pylori biological activity if it has one, two, three, and 
preferably more of the following properties: (1) if when expressed in the course of an H. 
pylori infection, it can promote, or mediate the attachment of//, pylori to a cell (2) it has an 
enzymatic activity characteristic of an H. pylori protein (3) or the gene which encodes it 
can rescue a lethal mutation in an H. pylori gene. A polypeptide has biological activity if it 
is an antagonist, agonist, or super-agonist of a polypeptide having one of the above-listed 
40 properties. 

Misexpression, as used herein, refers to a non-wild type pattern of gene expression. 
It includes: expression at non-wild type levels, i.e., over or under expression; a pattern of 
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5 expression that differs from wild type in terms of the time or stage at which the gene is 
expressed, e.g., increased or decreased expression (as compared with wild type) at a 
predetermined developmental period or stage; a pattern of expression that differs from wild 
type in terms of decreased expression (as compared with wild type) in a predetermined cell 
type or tissue type; a pattern of expression that differs from wild type in terms of the 

1 0 splicing size, amino acid sequence, post-transitional modification, or biological activity of 
the expressed polypeptide; a pattern of expression that differs from wild type in terms of 
the effect of an environmental stimulus or extracellular stimulus on expression of the gene, 
e.g., a pattern of increased or decreased expression (as compared with wild type) in the 
presence of an increase or decrease in the strength of the stimulus. 

1 5 As used herein, "host cells" and other such terms denoting microorganisms or 

higher eukaryotic cell lines cultured as unicellular entities refers to cells which can become 
or have been used as recipients for a recombinant vector or other transfer DNA, and include 
the progeny of the original cell which has been transfected. It is understood by individuals 
skilled in the art that the progeny of a single parental cell may not necessarily be 

20 completely identical in genomic or total DNA compliment to the original parent, due to 
accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs depending 

25 upon the host organism; in prokaryotes, such control sequences generally include a 

promoter, ribosomal binding site and terminators; in eukaryotes, generally such control 
sequences include promoters, terminators and in some instances, enhancers. The term 
control sequence is intended to include at a minimum, all components whose presence is 
necessary for expression, and may also include additional components whose presence is 

30 advantageous, for example, leader sequences. 

As used herein, the term "operably linked" refers to sequences joined or ligated to 
function in their intended manner, For example, a control sequence is operably linked to 
coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 

35 An "open reading frame", also referred to herein as ORF, is a region of nucleic acid 

which encodes a peptide. This region may represent a portion of a coding sequence or a 
total sequence. 

As used herein, a "coding sequence" is a nucleic acid sequence which is transcribed 
into messenger RNA and/or translated into a peptide when placed under the control of 
40 appropriate regulatory sequences. The boundaries of the coding sequence are determined by 
a translation start codon at the five prime terminus and a translation stop code at the three 
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prime terminus. A coding sequence can include but is not limited to messenger RNA, 
synthetic DNA, or recombinant nucleic acid sequences. 

A "gene product" is a protein or structural RNA which is specifically encoded for 
by a gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or other chemical 
entity which specifically binds to a molecule of interest. Probes are often associated with or 
capable of associating with a label. A label is a chemical moiety capable of detection. 
Typical labels comprise dyes, radioisotopes, luminescent and chemiluminescent moieties, 
fluorophores, enzymes, precipitating agents, amplification sequences, and the like. 
Similarly, a nucleic acid, peptide or other chemical entity which specifically binds to a 
molecule of interest and immobilizes such molecule is referred herein as a "capture ligand". 
Capture ligands are typically associated with or capable of associating with a support such 
as nitro-cellulose, glass, nylon membranes, beads, particles and the like. The specificity of 
hybridization is dependent on conditions such as the base pair composition of the 
nucleotides, and the temperature and salt concentration of the reaction. These conditions are 
readily discernable to one of ordinary skill in the art using routine experimentation. 

The experimental manipulation of such conditions has been well described in the 
literature including such books as Molecular Cloning; A Laboratory Manual Sambrook, J., 
Fritsch, E.F., Maniatis, T. } Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2nd 
ed.(1989). 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of chemistry, molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literature. See e.g., Sambrook, Fritsch, and Maniatis, Molecular 
Cloning; Laboratory Manual 2nd ed. ( 1 989); DNA Cloning, Volumes I and II (D.N Glover 
ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed, 1984); Nucleic Acid Hybridization 
(B.D. Hames & SJ. Higgins eds. 1984); the series, Methods in Enzymoloqy (Academic 
Press, Inc.), particularly VoL 154 and Vol. 155 (Wu and Grossman, eds.) and PCR-A 
Practical Approach (McPherson, Quirke, and Taylor, eds., 1991). 

Probes 

A nucleic acid isolated or synthesized in accordance with SEQ ID NOs:l-l 14 can 
be used as a probe to specifically detect K pylori. With the sequence information set forth 
in the present application, sequences of twenty or more nucleotides are identified which 
provide the desired inclusivity and exclusivity with respect to H. pylori, and extraneous 
nucleic acid sequences likely to be encountered during hybridization conditions. More 
preferably, the sequence will comprise at least twenty to thirty nucleotides to convey 
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5 stability to the hybridization product formed between the probe and the intended target 
molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but can 
be generated by recombinant DNA techniques. Individuals skilled in the art will readily 
recognize that the nucleic acid sequences, for use as probes, can be provided with a label to 
1 0 facilitate detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with SEQ ID NOs:l-l 14 may 
also be useful as probes to detect homologous regions (especially homologous genes) of 
other Helicobacter species using relaxed stringency hybridization conditions, as will be 
obvious to anybody skilled in the art. 

15 

Capture Lieand 

For use as a capture ligand, the nucleic acid selected in the manner described above 
with respect to probes, can be readily associated with a support. The manner in which 
nucleic acid is associated with supports is well known. Nucleic acid having twenty or more 

20 nucleotides in a sequence contained in SEQ ID NOs: 1-114 have utility to separate H. pylori 
nucleic acid from the nucleic acid of each other and other organisms. Nucleic acid having 
twenty or more nucleotides in a sequence shown in SEQ ID NOs: 1-1 14 may also have 
utility to separate other Helicobacter species from each other and from other organisms. 
Preferably, the sequence will comprise at least twenty nucleotides to convey stability to the 

2 5 hybridization product formed between the probe and the intended target molecules. 
Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be 
generated by recombinant DNA techniques. 

Primers 

30 Nucleic acid isolated or synthesized in accordance with the sequences described 

herein have utility as primers for the amplification of H. pylori nucleic acid. These nucleic 
acids may also have utility as primers for the amplification of nucleic acid sequences in 
other Helicobacter species. With respect to polymerase chain reaction (PCR) techniques, 
nucleic acid sequences of > 10-15 nucleotides contained in SEQ ID NOs:l-l 14 have utility 

35 in conjunction with suitable enzymes and reagents to create copies of H. pylori nucleic 
acid. More preferably, the sequence will comprise twenty or more nucleotides to convey 
stability to the hybridization product formed between the primer and the intended target 
molecules. Binding conditions of primers greater than 100 nucleotides are more difficult tc 
control to obtain specificity. High fidelity PCR can be used to ensure a faithful DNA copy 

40 prior to expression. In addition, amplified products can be checked by conventional 
sequencing methods. 
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The copies can be used in diagnostic assays to detect specific sequences, including 
genes from H. pylori and/or other Helicobacter species. The copies can also be 
incorporated into cloning and expression vectors to generate polypeptides corresponding to 
the nucleic acid synthesized by PCR, as will be described in greater detail below. 

Antisense 

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 
accordance with the sequences described herein have utility as antisense agents to prevent 
the expression of//, pylori genes. These sequences may also have utility as antisense 
agents to prevent expression of genes of other Helicobacter species. 

Nucleic acid or derivatives corresponding to H. pylori nucleic acid sequences is 
loaded into a suitable carrier such as a liposome or bacteriophage for introduction into 
bactenal cells. For example, a nucleic acid having twenty or more nucleotides is capable of 
binding to bacteria nucleic acid or bacteria messenger RNA. Preferably, the antisense 
nucleic acid is comprised of 20 or more nucleotides to provide necessary stability of a 
hybridization product of non-naturally occurring nucleic acid and bacterial nucleic acid 
and/or bacterial messenger RNA. Nucleic acid having a sequence greater than 1000 
nucleotides in length is difficult to synthesize but can be generated by recombinant DNA 
techniques. Methods for loading antisense nucleic acid in liposomes is known in the art as 
exemplified by U.S. Patent 4,241,046 issued December 23, 1 980 to Papahadjopoulos et al. 

Expressing H. nvlnri fienpg 

The function of a specific gene or operon can be ascertained by expression in a 
bacterial strain under conditions where the activity of the gene product(s) specified by the 
gene or operon in question can be specifically measured. Alternatively, a gene product may 
be produced in large quantities in an expressing strain for use as an antigen, an industrial 
reagent, for structural studies, etc. This expression could be accomplished in a mutant 
strain which lacks the activity of the gene to be tested, or in a strain that does not produce 
the same gene produces). This includes, but is not limited to other Helicobacter strains and 
other bacterial strains such as E. coli, Norcardia, Corynebacterium, and Streptomyces 
species. In some cases the expression host will utilize the natural Helicobacter promoter 
whereas in others, it will be necessary to drive the gene with a promoter sequence derived 
from the expressing organism (e.g., an E coli beta-galactosidase promoter for expression in 
E. coli). 

To express a gene product using the natural H. pylori promoter, a procedure such as 
the following is used. A restriction fragment containing the gene of interest, together with 
its associated natural promoter element and regulatory sequences (identified using the DNA 
sequence data) is cloned into an appropriate recombinant plasmid containing the following 
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5 components: an origin of replication that functions in the host organism, and an appropriate 
selectable marker. This can be accomplished by a number of procedures known to those 
skilled in the art. It is most preferably done by cutting the plasmid and the fragment to be 
cloned with the same restriction enzyme to produce compatible ends that can be ligated to 
join the two pieces together. The recombinant plasmid is introduced into the host organism 

1 0 by electroporation and cells containing the recombinant plasmid are identified by selection 
for the marker on the plasmid. Expression of the desired gene product is detected using an 
assay specific for that gene product. 

In the case of a gene that requires a different promoter, the body of the gene (coding 
sequence) is specifically excised and cloned into an appropriate expression plasmid. This 

1 5 subcloning can be done by several methods, but is most easily accomplished by PCR 
amplification of a specific fragment and ligation into an expression plasmid after treating 
the PCR product with a restriction enzyme or exonuclease to create suitable ends for 
cloning. 

20 Expressed Genes in Therapeutics 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility to generate proteins and peptides. The nucleic acid exemplified in SEQ 
ID NOs:l-l 14 or fragments of said nucleic acid sequences encoding immunogenic portions 
ofH. pylori proteins (SEQ ID NO: 115-228) can be cloned into suitable vectors or used to 
25 isolate nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and 
cloned into a suitable vector. 

The host cell may be any procaryotic or eucaryotic cell. For example, an H. pylori 
peptide may be expressed in bacterial cells such as E. coli, insect cells (baculovirus), yeast, 
or mammalian cells such as Chinese hamster ovary cell (CHO).. Other suitable host cells 
30 are known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead to 
partial or complete glycosylation and/or formation of relevant inter- or intra-chain disulfide 
bonds of a recombinant peptide product. Examples of vectors for expression in yeast 
S. cerivisae include pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa (Kurjan 
3 5 and Herskowitz, ( 1 982) Cell 30:933-943), P JRY88 (Schultz et al., ( 1 987) Gene 54-1 13- 
123), and P YES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors available 
for expression of proteins in cultured insect cells (SF 9 cells) include the pAc series (Smith 
et al., (1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow, V.A.. and 
Summers, M.D., (1989) Virology 170:31-39). Generally, COS cells (Gluzman, Y., (1981) 
40 Cell 23 : 1 75- 1 82) are used in conj unction with such vectors as pCDM 8 (Aruffo, A. and 
Seed, B., (1987) Proc. Natl. Acad. Sci. USA £4:8573-8577) for transient 
amplification/expression in mammalian cells, while CHO (dhfr Chinese Hamster Ovary) 
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cells are used with vectors such as pMT2PC (Kaufman et al. (1 987), EMBO J. 6: 1 87-1 95) 
for stable amplification/expression in mammalian cells. Vector DNA can be introduced 
into mammalian cells via conventional techniques such as calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, or electroporation. 
Suitable methods for transforming host cells can be found in Sambrook et al. ( Molecular 
Cloning: A Laboratory Manual 2nd Edition, Cold Spring Harbor Laboratory press 
(1989)), and other laboratory textbooks. 

Expression in procaryotes is most often carried out in E. coli with either fusion or 
non-fusion inducible expression vectors. Fusion vectors usually add a number of NH-> 
terminal amino acids to the expressed target gene. These NH 2 terminal amino acids often 
are referred to as a reporter group. Such reporter groups usually serve two purposes: 1 ) to 
increase the solubility of the target recombinant protein; and 2) to aid in the purification of 
the target recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the reporter 
group and the target recombinant protein to enable separation of the target recombinant 
protein from the reporter group subsequent to purification of the fusion protein. Such 
enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and 
enterokinase. Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne, 
Australia), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, 
Piscataway, NJ) which fuse glutathione S-transferase, maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. A preferred reporter group is poly(His), 
which may be fused to the amino or carboxy terminus of the protein and which renders the 
recombinant fusion protein easily purifiable by metal chelate chromatography. 

Inducible non-fusion expression vectors include pTrc (Amann et al., (1988) Gene 
69:30 1 -3 1 5) and pETl 1 d (Studier et al., Gene Expression Technology: Methods in 
EnzvmoloKY JM. Academic Press, San Diego, California (1990) 60-89). While target gene 
expression relies on host RNA polymerase transcription from the hybrid trp-lac fusion 
promoter in pTrc, expression of target genes inserted into pETl Id relies on transcription 
from the T7 gnlO-lac 0 fusion promoter mediated by coexpressed viral RNA polymerase 
(T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) 
from a resident X prophage harboring a T7 gnl under the transcriptional control of the 
lacUV 5 promoter. 

For example, a host cell transfected with a nucleic acid vector directing expression 
of a nucleotide sequence encoding an H. pylori peptide can be cultured under appropriate 
conditions to allow expression of the peptide to occur. The peptide may be secreted and 
isolated from a mixture of cells and medium containing the peptide. Alternatively, the 
peptide may be retained cytoplasmically and the cells harvested, lysed and the protein 
isolated. A cell culture includes host cells, media and other byproducts. Suitable media for 
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5 cell culture are well known in the art. Peptides of the invention can be isolated from cell 
culture medium, host cells, or both using techniques known in the art for purifying proteins 
including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, 
electrophoresis, and immunoaffinity purification with antibodies specific for such peptides. 
Additionally, in many situations, peptides can be produced by chemical cleavage of a 

1 0 native protein (e.g., tryptic digestion) and the cleavage products can then be purified by 
standard techniques. 

In the case of membrane bound proteins, these can be isolated from a host cell by 
contacting a membrane-associated protein fraction with a detergent forming a solubilized 
complex, where the membrane-associated protein is no longer entirely embedded in the 

1 5 membrane fraction and is solubilized at least to an extent which allows it to be 

chromatographically isolated from the membrane fraction. Several different criteria are 
used for choosing a detergent suitable for solubilizing these complex. For example, one 
property considered is the ability of the detergent to solubilize the H. pylori protein within 
the membrane fraction at minimal denaturation of the membrane-associated protein 

20 allowing for the activity or functionality of the membrane-associated protein to return upon 
reconstitution of the protein. Another property considered when selecting the detergent is 
the critical micells concentration (CMC) of the detergent in that the detergent of choice 
preferably has a high CMC value allowing for ease of removal after reconstitution. A third 
property considered when selecting a detergent is the hydrophobicity of the detergent. 

25 Typically, membrane-associated proteins are very hydrophobic and therefore detergents 
which are also hydrophobic, e.g. the triton series, would be useful for solubilizing the 
hydrophobic proteins. Another property important to a detergent can be the capability of 
the detergent to remove the //. pylori protein with minimal protein-protein interaction 
facilitating further purification. A fifth property of the detergent which should be 

30 considered is the charge of the detergent. For example, if it is desired to use ion exchange 
resins in the purification process then preferably detergent should be an uncharged 
detergent. Chromatographic techniques which can be used in the final purification step are 
known in the art and include hydrophobic interaction, lectin affinity, ion exchange, dye 
affinity and immunoaffinity. 

3 5 One strategy to maximize recombinant H. pylori peptide expression in E. coli is to 

express the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Pene Expression Technology: Methods in 
Enzvmologv 185 . Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy would be to alter the nucleic acid encoding H. pylori peptide to be inserted into an 

40 expression vector so that the individual codons for each amino acid would be those 

preferentially utilized in highly expressed E. coli proteins (Wada et al., (1992) Nuc. Acids 
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Res t 20:21 1 1-21 18). Such alteration of nucleic acids of the invention can be carried out by 
standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S Patent 
No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. Patent Nos. 
4,401,796 and 4,373,071, incorporated by reference herein). 

Drug Screening Assay*: 

By making available purified and recombinant H. pylori polypeptides, the present 
invention provides assays which can be used to screen for drugs which are either agonists 
or antagonists of the normal cellular function, in this case, of the subject H. pylori 
polypeptides, or of their role in intracellular signaling. Such inhibitors or potentiators may 
be useful as new therapeutic agents to combat H. pylori infections in humans. A variety of 
assay formats will suffice and, in light of the present inventions, will be comprehended by 
the skilled artisan. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often preferred 
as "primary" screens in that they can be generated to permit rapid development and 
relatively easy detection of an alteration in a molecular target which is mediated by a test 
compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being focused 
primarily on the effect of the drug on the molecular target as may be manifest in an 
alteration of binding affinity with other proteins or change in enzymatic properties of the 
molecular target. Accordingly, in an exemplary screening assay of the present invention, 
the compound of interest is contacted with an isolated and purified H. pylori polypeptide. 

Screening assays may be constructed in vitro with a purified H. pylori enzyme such 
that the action of the enzyme produces an easily detectable reaction product. The efficacy 
of the compound can be assessed by generating dose response curves from data obtained 
using various concentrations of the test compound. Moreover, a control assay can also be 
performed to provide a baseline for comparison. Suitable products include those with 
distinctive absorption, fluorescence, or chemi-luminescence properties, for example, 
because detection may be easily automated. A variety of synthetic or naturally occurring 
compounds may be tested in the assay to identify those which inhibit or potentiate the 
activity of the H. pylori enzyme. Some of these active compounds may directly, or with 
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5 chemical alterations to promote membrane permeability or solubility, also inhibit or 
potentiate the same enzymatic activity in whole, live H. pylori cells. 

Antibodies 

The invention also includes antibodies specifically reactive with the subject H. 

1 0 pylori-like polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 
Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a 
hamster or rabbit can be immunized with an immunogenic form of the peptide. Techniques 
for conferring immunogenic ity on a protein or peptide include conjugation to carriers or 

1 5 other techniques well known in the art. An immunogenic portion of the subject H. pylori 
polypeptide can be administered in the presence of adjuvant. The progress of immunization 
can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or 
other immunoassays can be used with the immunogen as antigen to assess the levels of 
antibodies. In a preferred embodiment, the subject antibodies are immunospecific for 

20 antigenic determinants of the H. pylori polypeptides of the invention, e.g. antigenic 

determinants of a polypeptide shown in SEQ ID NOs:l 1 5-228 or a closely related human or 
non-human mammalian homolog (e.g. 90% percent homologous, more preferably at least 
95 percent homologous). In yet a further preferred embodiment of the present invention, 
the wAi-H. pylori antibodies do not substantially cross react (i.e. react specifically) with a 

25 protein which is: e.g., less than 80% percent homologous to a sequence shown in SEQ ID 
NOs: 1 1 5-228. By "not substantially cross react", it is meant that the antibody has a binding 
affinity for a non-homologous protein which is less than 10 percent, more preferably less 
than 5 percent, and even more preferably less than 1 percent, of the binding affinity for a 
protein contained in SEQ ID NOs: 11 5-228. In a most preferred embodiment, there is no 

30 crossreactivity between bacterial and mammalian antigens. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with H. pylori polypeptides. Antibodies can be fragmented using 
conventional techniques and the fragments screened for utility in the same manner as 
described above for whole antibodies. For example, F(ab*)2 fragments can be generated by 

3 5 treating antibody with pepsin. The resulting F(ab')2 fragment can be treated to reduce 
disulfide bridges to produce Fab' fragments. The antibody of the present invention is 
further intended to include bispecific and chimeric molecules having an anti-//. pylori 
portion. 

Both monoclonal and polyclonal antibodies (Ab) directed against H. pylori 
40 polypeptides or H. pylori polypeptide variants, and antibody fragments such as Fab' and 
F(ab , ) 2 , can be used to block the action of//, pylori polypeptide and allow the study of the 
role of a particular H. pylori polypeptide of the present invention in aberrant or unwanted 
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intracellular signaling, as well as the normal cellular function of the H. pylori and by 
microinjection of and-//, pylori polypeptide antibodies of the present invention. 

Antibodies which specifically bind H. pylori epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of H. pylori antigens. Anti-//. pylori polypeptide antibodies can be 
used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate H. 
pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, the 
ability to monitor H. pylori polypeptide levels in an individual can allow determination of 
the efficacy of a given treatment regimen for an individual afflicted with such a disorder. 
The level of an H. pylon polypeptide can be measured in cells found in bodily fluid, such 
as in urine samples or can be measured in tissue, such as produced by gastric biopsy. 
Diagnostic assays using anti-//. pylori antibodies can include, for example, immunoassays 
designed to aid in early diagnosis of//, pylori infections. The present invention can also be 
used as a method of detecting antibodies contained in samples from individuals infected by 
this bacterium using specific H. pylori antigens. 

Another application of anti-//. pylori polypeptide antibodies of the present invention 
is in the immunological screening of cDNA libraries constructed in expression vectors such 
as A.gtl 1 , Xgtl 8-23, AZAP, and XORF8. Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini consist of 
6-galactosidase amino acid sequences and whose carboxy termini consist of a foreign 
polypeptide. Antigenic epitopes of a subject H. pylori polypeptide can then be detected 
with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates 
with anti-//. pylori polypeptide antibodies. Phage, scored by this assay, can then be 
isolated from the infected plate. Thus, the presence of//, pylori gene homologs can be 
detected and cloned from other species, and alternate isoforms (including splicing variants) 
can be detected and cloned. 

Vaccines 

The present invention also includes vaccine compositions for protection against 
infection by H. pylori or for treatment of//, pylori infection, a gram-negative spiral 
microaerophilic bacterium. In one embodiment, the vaccine compositions contain 
immunogenic surface proteins from H. pylori, or portion thereof, and a pharmaceutical^ 
acceptable carrier. Nucleic acids within the scope of the invention are exemplified by the 
nucleic acids shown in SEQ ID NOs:l-1 14 and which encode H. pylori surface proteins 
shown in SEQ ID NOs: 1 1 5-228. However, any nucleic acid encoding an immunogenic H. 
pylori protein, or portion thereof, which is capable of expression in a cell, can be used in 
the present invention. These vaccines can have therapeutic and prophylactic utilities. 
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5 Another aspect of the present invention provides vaccine compositions for 

protection against infection by K pylori or for treatment of H. pylori infection, which 
contain a modified immunogenic K pylori protein or portion thereof, and a 
pharmaceutical^ acceptable carrier. It is possible to modify the structure of a K pylori 
protein or peptide for such purposes as increasing solubility, enhancing stability (e.g., shelf 

1 0 life ex vivo and resistance to proteolytic degradation in vivo). A modified H, pylori protein 
or peptide can be produced in which the amino acid sequence has been altered, such as by 
amino acid substitution, deletion, or addition. 

Another example of modification of an K pylori peptide is substitution of cysteine 
residues preferably with alanine, serine, threonine, leucine or glutamic acid residues to 

15 minimize dimerization via disulfide linkages. In addition, amino acid side chains of 
fragments of the protein of the invention can be chemically modified. Another 
modification is cyclization of the peptide. 

In order to enhance stability and/or reactivity, an H. pylori protein or peptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 

20 protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 
natural amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified protein within the scope of this invention. Furthermore, an Kpylori protein can 
be modified using polyethylene glycol (PEG) according to the method of A. Sehon and co- 
workers (Wie et al., supra) to produce a protein conjugated with PEG. In addition, PEG 

25 can be added during chemical synthesis of the protein. Other modifications of H. pylori 

proteins include reduction/alkylation (Tarr, Methods of Protein Microcharactehzation, J. E. 
Silver ed., Humana Press, Clifton NJ 155-194 (1986)); acylation (Tarr, supra); chemical 
coupling to an appropriate carrier (Mishell and Shiigi, eds. Selected Methods in Cellular 
Immunology, WH Freeman, San Francisco, CA (1980), U.S. Patent 4,939,239; or mild 

30 formalin treatment (Marsh, (1971) Int. Arch of Allergy andAppl Immunol. 4J_: 199 - 215). 

To facilitate purification and potentially increase solubility of an H. pylori protein 
or peptide, it is possible to add an amino acid fusion moiety to the peptide backbone. For 
example, hexa-histidine can be added to the protein for purification by immobilized metal 
ion affinity chromatography (Hochuli, E. et al., (1988) Bio/Technology, 6: 1321 - 1325). In 

35 addition, to facilitate isolation of peptides free of irrelevant sequences, specific 

endoprotease cleavage sites can be introduced between the sequences of the fusion moiety 
and the peptide. 

To potentially aid proper antigen processing of T cell epitopes within an H. pylori 
protein of the invention, canonical protease sensitive sites can be engineered between 
40 regions, each comprising at least one T cell epitope via recombinant or synthetic methods. 
For example, charged amino acid pairs, such as KK or RR, can be introduced between 
regions within a protein or fragment during recombinant construction thereof. The 
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resulting peptide can be rendered sensitive to cleavage by cathepsin and/or other trypsin- 
like enzymes which would generate portions of the protein containing one or more T cell 
epitopes. In addition, such charged amino acid residues can result in an increase in the 
solubility of the peptide. 

Site-directed mutagenesis of a nucleic acid encoding an H. pylori protein can be 
used to modify the structure of the peptide by methods known in the art. Such methods 
may, among others, include polymerase chain reaction (PCR) with oligonucleotide primers 
bearing one or more mutations (Ho et al., ( 1 989) Gene, 77: 5 1 - 59) or total synthesis of 
mutated genes (Hostomsky, Z. et al., (1989) Biochem. Biophys. Res. Comm, 16k 1056 - 
1063). To enhance recombinant protein expression, the aforementioned methods can be 
applied to change the codons present in the cDNA sequence of the invention to those 
preferentially utilized by the host cell in which the recombinant protein is being expressed 
(Wada et al., supra). An extensive discussion of mutagenesis protocols is provided in the 
"Production of Fragments and Analogs" section herein. 

Another aspect of the invention provides a vaccine composition for protection 
against infection by K pylori which contains an immunogenic fragment of an K pylori 
protein or portion thereof, and a pharmaceutically acceptable carrier. Preferred fragments 
include peptides of at least about 10 amino acid residues in length, preferably about 10-20 
amino acid residues in length, and more preferably about 12-16 amino acid residues in 
length. 

Immunogenic peptides of the invention can be obtained, for example, by screening 
peptides recombinantly produced from the corresponding fragment of the nucleic acid 
encoding the full-length H. pylori protein. In addition, fragments can be chemically 
synthesized using techniques known in the art such as conventional Mcrrifield solid phase 
f-Moc or t-Boc chemistry. For example, an K pylori protein may be arbitrarily divided 
into fragments of desired length with no overlap of the fragments, or preferably divided 
into overlapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptides having the 
ability to induce a T cell response, such as stimulation (proliferation, cytokine secretion). 
An extensive discussion of peptide analogs and fragments is provided in the "Production of 
Fragments and Analogs" section herein. 

In one embodiment, immunogenic H. pylori fragments can be identified by the 
ability of the peptide to stimulate T cells. Peptides which stimulate T cells, as determined 
by, for example, T cell proliferation or cytokine secretion are defined herein as comprising 
at least one T cell epitope. T cell epitopes are believed to be involved in initiation and 
perpetuation of the immune response to the protein allergen which is responsible for the 
clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at 
the level of the T helper cell by binding to an appropriate HLA molecule on the surface of 
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5 an antigen presenting cell, thereby stimulating the T cell subpopulation with the relevant T 
cell receptor for the epitope. These events lead to T cell proliferation, lymphokine 
secretion, local inflammatory reactions, recruitment of additional immune cells to the site 
of antigen/T cell interaction, and activation of the B cell cascade, leading to the production 
of antibodies. One isotype of these antibodies, IgE, is fundamentally important to the 

1 0 development of allergic symptoms and its production is influenced early in the cascade of 
events at the level of the T helper cell, by the nature of the lymphokines secreted. A T cell 
epitope is the basic element, or smallest unit of recognition by a T cell receptor, where the 
epitope comprises amino acids essential to receptor recognition. Amino acid sequences 
which mimic those of the T cell epitopes and which modify the allergic response to protein 

1 5 allergens are within the scope of this invention. 

In another embodiment, immunogenic components of the invention are identified 
through genomic vaccination. The basic protocol is based on the idea that expression 
libraries consisting of all or parts of a pathogen genome, e.g., an H. pylori genome, can 
confer protection when used to genetically immunize a host. This expression library 

20 immunization (ELI) is analogous to expression cloning and involves reducing a genomic 
expression library of a pathogen, e.g., K pylori, into plasmids that can act as genetic 
vaccines. The plasmids can also be designed to encode genetic adjuvants which can 
dramatically stimulate the humoral response. These genetic adjuvants can be introduced at 
remote sites and act as well extracelluraly as intracellularly. 

25 This is a new approach to vaccine production that has many of the advantages of 

live/attenuated pathogens but no risk of infection. An expression library of pathogen DNA 
is used to immunize a host thereby producing the effects of antigen presentation of a live 
vaccine without the risk. For example, in the present invention, random fragments from the 
H. pylori genome or from cosmid or plasmid clones, as well as PCR products from genes 

30 identified by genomic sequencing, can be used to immunize a host. The feasibility of this 
approach has been demonstrated with Mycoplasma pulmonis (Barry et al., Nature 377:632- 
635, 1 995), where even partial expression libraries of Mycoplasma pulmonis, a natural 
pathogen in rodents, provided protection against challenge from the pathogen. 

ELI is a technique that allows for production of a non-infectious multipartite 

35 vaccine, even when little is known about pathogen's biology, because ELI uses the immune 
system to screen candidate genes. Once isolated, these genes can be used as genetic 
vaccines or for development of recombinant protein vaccines. Thus, ELI allows for 
production of vaccines in a systematic, largely mechanized fashion. 

Screening peptides for those which are immunogenic can be accomplished using 

40 one or more of several different assays. For example, in vitro, peptide T cell stimulatory 
activity is assayed by contacting a peptide known or suspected of being immunogenic with 
an antigen presenting cell which presents appropriate MHC molecules in a T ceil culture. 
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Presentation of an immunogenic K pylori peptide in association with appropriate MHC 
molecules, to T cells, in conjunction with the necessary costimulation, has the effect of 
transmitting a signal to the T cell that induces the production of increased levels of 
cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be 
obtained and assayed for interleukin-2 or other known cytokines. For example, any one of 
several conventional assays for interleukin-2 can be employed, such as the assay described 
in Proc. Natl. Acad Sci USA, 86: 1333 (1989) the pertinent portions of which are 
incorporated herein by reference. A kit for an assay for the production of interferon is also 
available from Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 
thymidine incorporation. The proliferation of T cells can be measured in vitro by 
determining the amount of ^H-labeled thymidine incorporated into the replicating DNA of 
cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell division 
can be quantified. 

To produce modified proteins or immunogenic fragments by recombinant DNA 
techniques, an expression vector containing a nucleic acid encoding all or a portion of a H. 
pylori protein, operably linked to at least one regulatory sequence can be used. Operably 
linked is intended to mean that the nucleotide sequence is linked to a regulatory sequence in 
a manner which allows expression of the nucleotide sequence. Regulatory sequences are 
art-recognized and include promoters, enhancers and other expression control elements. 
Such regulatory sequences are described in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). It should be 
understood that the design of the expression vector may depend on such factors as the 
choice of the host cell to be transformed and/or the type of protein desired to be expressed. 
In one embodiment of the present invention, the expression vector includes nucleic acid, 
preferably a DNA, encoding a modified H. pylori protein or immunogenic fragment having 
all or a portion of the amino acid sequence . Such expression vectors can be used to 
transfect cells to thereby produce proteins or peptides, including fusion proteins or peptides 
encoded by nucleic acids as described herein. 

Host cells suitable for transfection and recombinant production of//, pylori proteins 
of the invention include any procaryotic or eucaryotic cell. For example, an K pylori 
protein or peptide may be expressed in bacterial cells such as £ coll insect cells 
(baculovirus), yeast, or mammalian cells such as Chinese hamster ovary cells (CHO). 
Other suitable host cells can be found in Goeddel, (1990) supra or known to those skilled in 
the art. 

K pylori proteins and fragments of the invention can also be chemically 
synthesized, using techniques known in the art such as conventional Merrifield solid phase 
f-Moc or t-Boc chemistry. The nucleic acids of the invention can also be chemically 
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5 synthesized using standard techniques. Various methods of chemically synthesizing 
polydeoxynucleotides are known, including solid-phase synthesis which, like peptide 
synthesis, has been fully automated in commercially available DNA synthesizers (see e.g., 
Itakura et al. U.S. Patent No. 4,598,049; Caruthers et_aL, U.S. Patent No. 4,458,066; and 
Itakura, U.S. Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

10 Vaccine compositions of the present invention containing DNA encoding 

immunogenic protein from H. pylori, or containing modified protein or fragments, contain 
both the DNA or protein and a pharmaceutical ly acceptable carrier. The term 
"pharmaceutically acceptable carrier" refers to a carrier that does not cause an allergic 
reaction or other untoward effect in patients to whom it is administered. Suitable 

1 5 pharmaceutically acceptable carriers include, for example, one or more of water, saline, 
phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations 
thereof. Pharmaceutically acceptable carriers may further comprise minor amounts of 
auxiliary substances such as wetting or emulsifying agents, preservatives or buffers, which 
enhance the shelf life or effectiveness of the antibody. For vaccines of the invention 

20 containing modified H. pylori protein or immunogenic protein fragments, the protein or 
peptide must be coadministered with a suitable adjuvant. 

It will be apparent to those of skill in the art that the therapeutically effective 
amount of DNA or protein of this invention will depend, inter alia, upon the administration 
schedule., the unit dose of antibody administered, whether the protein or DNA is 

25 administered in combination with other therapeutic agents, the immune status and health of 
the patient, and the therapeutic activity of the particular protein or DNA. 

Vaccine compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. (1990) Science 247: 1465-1468 and by Sedegah 

30 et al. (1 994) Immunology 91; 9866-9870. Other modes of administration include oral and 
pulmonary formulations, suppositories, and transdermal applications. Oral immunization is 
preferred over parenteral methods for inducing protection against infection by H. pylori. 
Czinn et. al. (1993) Vaccine 11: 637-642. Oral formulations include such normally 
employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, 

35 magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. 

The vaccine compositions of the invention can include an adjuvant, including, but 
not limited to aluminum hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr- 
MDP); N-acetyl-nor-muramyl-L*alanyl-D-isoglutamine (CGP 1 1637, referred to as nor- 
MDP); N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-( 1 , -2*-dipalmitoyI-sn- 

40 glycero-3-hydroxyphos-phoryloxy)-ethylamine (CGP 1 9835 A, referred to a MTP-PE); 
RIB1, which contains three components from bacteria; monophosphoryl lipid A; trehalose 
dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 
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5 emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of 

cholera toxin, including its B subunit, and/or conjugates or genetically engineered fusions 
of the H. pylori polypeptide with cholera toxin or its B subunit, procholeragenoid, fungal 
polysaccharides, including schizophyllan, muramyl dipeptide, muramyl dipeptide 
derivatives, phorbol esters, labile toxin of E. coli, non-//. pylori bacterial lysates. block 

1 0 polymers or saponins. 

Other suitable delivery methods include biodegradable microcapsules or immuno- 
stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered 
attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like 
particles, e.g.,, hluetongue. The amount of adjuvant employed will depend on the type of 

1 5 adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably used 
in an amount of 5 jig to 50 jig, for example 1 0 |ig to 35 ng. When used in the form of 
microcapsules, the amount used will depend on the amount employed in the matrix of the 
microcapsule to achieve the desired dosage. The determination of this amount is within the 
skill of a person of ordinary skill in the art. 

20 Carrier systems in humans may include enteric release capsules protecting the 

antigen from the acidic environment of the stomach, and including H. pylori polypeptide in 
an insoluble form as fusion proteins. Suitable carriers for the vaccines of the invention are 
enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 0.2 N 
NaHC03 and/or saline. 

25 Vaccines of the invention can be administered as a primary prophylactic agent in 

adults or in children, as a secondary prevention, after successful eradication of K pylori in 
an infected host, or as a therapeutic agent with the aim to induce an immune response in a 
susceptible host to prevent infection by H. pylori. The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art. Thus, 

30 for adults a suitable dosage will be in the range of 1 0 \xg to 10 g, preferably 1 0 jag to 100 
mg, for example 50 ng to 50 mg. A suitable dosage for adults will also be in the range of 5 
jig to 500 mg. Similar dosage ranges will be applicable for children. Those skilled in the 
art will recognize that the optimal dose may be more or less dependant upon the patient's 
body weight, disease, the route of administration, and other factors. Those skilled in the art 

35 will also recognize that appropriate dosage levels can be obtained based on results with 
known oral vaccines such as, for example, a vaccine based on an E. coli lysate (6 mg dose 
daily up to total of 540 mg) and with an enterotoxigenic E. coli purified antigen (4 doses of 
1 mg) (Schulman et al., 1 Urol 150:917-921 (1993); Boedecker et ^American 
Gastroenterological Assoc. 999:A-222 (1993)). The number of doses will depend upon the 

40 disease, the formulation, and efficacy data from clinical trials. Without intending any 

limitation as to the course of treatment, the treatment can be administered over 3 to 8 doses 
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5 for a primary immunization schedule over 1 month (Boedeker, American 
Gastroenterological Assoc. 888:A-222 (1 993)). 

In a preferred embodiment, a vaccine composition of the invention can be based on 
a killed whole E. coli preparation with an immunogenic fragment of an H. pylori protein of 
the invention expressed on its surface or it can be based on an E. coli lysate, wherein the 

1 0 killed E. coli acts as a carrier or an adjuvant. 

It will be apparent to those skilled in the art that some of the vaccine compositions 
of the invention are useful only for preventing H. pylori infection, some are useful only for 
treating H. pylori infection, and some are useful for both preventing and treating H. pylori 
infection. In a preferred embodiment; the vaccine composition nf the invention provides 

1 5 protection against H. pylori infection by stimulating humoral and/or cell-mediated 
immunity against H. pylori. It should be understood that amelioration of any of the 
symptoms of//, pylori infection is a desirable clinical goal, including a lessening of the 
dosage of medication used to treat H. pylori-caused disease, or an increase in the 
production of antibodies in the serum or mucous of patients. 

20 

Determination of candidate protein antigens for antibody and vaccine development 

The selection of candidate protein antigens for vaccine development were derived 
from the nucleotide sequence. First, all possible open reading frames (ORF's) greater than 
50 nucleotides in all six reading frames were identified and translated into amino acid 
25 sequences. Second, the identified ORF's were analyzed for homology to other known 
exported or membrane proteins and the ORF's were also analyzed using the discriminant 
analysis described by Klein, et al. (Klein, P., Kanehsia, M., and DeLisi, C. (1 985) 
Biochimica et Biophysica Acta 815, 468-476) for predicting exported and membrane 
proteins. 

30 Homology searches were performed using the BLAST algorithm contained in the 

Wisconsin Sequence Analysis Package (Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, WI 5371 1) to compare each predicted ORF amino acid 
sequence with all sequences found in the current GenBank, SWISS-PROT and PIR 
databases. BLAST searches for local alignments between the ORF and the databank 

35 sequences and reports a probability score which indicates the probability of finding this 
sequence by chance in the database. ORF's with significant homology (e.g. probabilities 
better than lx 1 0 (ee-6)) to membrane or exported proteins represent likely protein antigens 
for vaccine development. Possible functions are provided to some of the H. pylori genes as 
indicated in Figure 560 based on sequence homology to genes cloned in other organisms. 

40 Discriminant analysis (Klein, et al. supra) was used to examine the ORF amino acid 

sequences using our own software. This algorithm uses the intrinsic information contained 
in the ORF amino acid sequence and compares it to information derived from the properties 
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of known membrane and exported proteins. This comparison predicts which proteins will 
be exported, membrane associated or cytoplasmic. ORF amino acid sequences identified as 
exported or membrane associated by this algorithm are likely protein antigens for vaccine 
development. 

Infrequently it is not possible to distinguish between multiple possible nucleotides 
at a given position in the nucleic acid sequence. In those cases the ambiguities are denoted 
by an extended alphabet as follows: 

These are the official IUPAC-IUB single-letter base codes 



Code 


Base Description 




G 


Guanine 




A 


Adenine 




T 


Thymine 




C 


Cytosine 




R 


Purine 


(A orG) 


Y 


Pyrimidine 


(C or T or U) 


M 


Amino 


(A or C) 


K 


Ketone 


(G or T) 


S 


Strong interaction 


(C or G) 


W 


Weak interaction 


(A or T) 


H 


Not-G 


(A or C or T) 


B 


Not-A 


(C or G or T) 


V 


Not-T (not-U) 


(A or C or G) 


D 


Not-C 


(A or G or T) 


N 


Any 


(A or C or G or T) 



The amino acid translations of this invention account for the ambiguity in the 
nucleic acid sequence by translating the ambiguous codon as the letter "X". In all cases, the 
permissible amino acid residues at a position are clear from an examination of the nucleic 
acid sequence based on the standard genetic code. 

Production of Fragments and Analo g s 

The inventor has discovered novel gene products, e.g. bacterial surface gene 
products, from the organism H. pylori. Once an example of this core structure has been 
provided one skilled in the art can alter the disclosed structure (of//, pylori genes), e.g., by 
producing fragments or analogs, and test the newly produced structures for activity. 
Examples of prior art methods which allow the production and testing of fragments and 
analogs are discussed below. These, or analogous methods can be used to make and screen 
libraries of polypeptides, e.g., libraries of random peptides or libraries of fragments or 
analogues of cellular proteins for the ability to bind K pylori polypeptides. Such screens 
are useful for discovery of inhibitors of//, pylori. 
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5 Generation of Fragments 

Fragments of a protein can be produced in several ways, e.g., recombinantly, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes 
1 0 the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. 
Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an 
array of fragments. DNA's which encode fragments of a protein can also be generated by 
random shearing, restriction digestion or a combination of the above-discussed methods. 
Fragments can also be chemically synthesized using techniques known in the art 
1 5 such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 

peptides of the present invention may be arbitrarily divided into fragments of desired length 
with no overlap of the fragments, or divided into overlapping fragments of a desired length. 

Production of Altered DNA and Peptide Sequences: Random Methods 
20 Amino acid sequence variants of a protein can be prepared by random mutagenesis 

of DNA which encodes a protein or a particular domain or region of a protein. Useful 
methods include PCR mutagenesis and saturation mutagenesis. A library of random amino 
acid sequence variants can also be generated by the synthesis of a set of degenerate 
oligonucleotide sequences. (Methods for screening proteins in a library of variants are 
25 elsewhere herein). 

PCR Mutagenesis 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random 
mutations into a cloned fragment of DNA (Leung et aL, 1989, Technique 1:11-15). This is 

30 a very powerful and relatively rapid method of introducing random mutations. The DNA 
region to be mutagenized is amplified using the polymerase chain reaction (PCR) under 
conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, e.g., by 
using a dGTP/dATP ratio of five and adding Mn 2+ to the PCR reaction. The pool of 
amplified DNA fragments are inserted into appropriate cloning vectors to provide random 

35 mutant libraries. 

Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of single 
base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This 
40 technique includes generation of mutations, e.g., by chemical treatment or irradiation of 
single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The 
mutation frequency can be modulated by modulating the severity of the treatment, and 
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5 essentially all possible base substitutions can be obtained. Because this procedure does not 
involve a genetic selection for mutant fragments both neutral substitutions, as well as those 
that alter function, are obtained. The distribution of point mutations is not biased toward 
conserved sequence elements. 

10 Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 
out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 
appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 

15 the art (see for example, Narang, S A ( 1 983) Tetrahedron 39:3 ; Itakura et al. ( 1 98 1 ) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura et al. (1984)/* nnu. Rev. Biochem. 53:323; Itakura 
et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1:477. Such techniques 
have been employed in the directed evolution of other proteins (see, for example, Scott et 

20 al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. 
(1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. 
Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

Production of Altered DNA and Peptide Sequences: Methods for Directed Mutagenesis 
25 Non-random or directed, mutagenesis techniques can be used to provide specific 

sequences or mutations in specific regions. These techniques can be used to create variants 
which include, e.g., deletions, insertions, or substitutions, of residues of the known amino 
acid sequence of a protein. The sites for mutation can be modified individually or in series, 
e.g., by (1 ) substituting first with conserved amino acids and then with more radical choices 
30 depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of 
the same or a different class adjacent to the located site, or combinations of options 1-3. 

Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain 
35 residues or regions of the desired protein that are preferred locations or domains for 

mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine scanning, 
a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, 
His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most 
preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction 
40 of the amino acids with the surrounding aqueous environment in or outside the cell. Those 
domains demonstrating functional sensitivity to the substitutions are then refined by 
introducing further or other variants at or for the sites of substitution. Thus, while the site 
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5 for introducing an amino acid sequence variation is predetermined, the nature of the 

mutation per se need not be predetermined. For example, to optimize the performance of a 
mutation at a given site, alanine scanning or random mutagenesis may be conducted at the 
target codon or region and the expressed desired protein subunit variants are screened for 
the optimal combination of desired activity. 

10 

Oli onucleotide-Mediated Mutagenesis 

Oligonucleotide-mediated mutagenesis is a useful method for preparing 
substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al M (DNA 2:183, 
1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a 

1 5 mutation to a DNA template, where the template is the single-stranded form of a plasmid or 
bacteriophage containing the unaltered or native DNA sequence of the desired protein. 
After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the oligonucleotide primer, 
and will code for the selected alteration in the desired protein DNA. Generally, 

20 oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide 
will have 12 to 15 nucleotides that are completely complementary to the template on either 
side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will 
hybridize properly to the single-stranded DNA template molecule. The oligonucleotides 
are readily synthesized using techniques known in the art such as that described by Crea et 

25 al. (Proc. Natl Acad. ScL USA, 75: 5765[1978]). 

Cassette Mutagenesis 

Another method for preparing variants, cassette mutagenesis, is based on the 
technique described by Wells et al. (Gene, 34:3 1 5[1985]). The starting material is a 

30 plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 

codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique 
restriction endonuclease site on each side of the identified mutation site(s). If no such 
restriction sites exist, they may be generated using the above-described oligonucleotide- 
mediated mutagenesis method to introduce them at appropriate locations in the desired 

35 protein subunit DNA. After the restriction sites have been introduced into the plasmid, the 
plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the 
sequence of the DNA between the restriction sites but containing the desired mutation(s) is 
synthesized using standard procedures. The two strands are synthesized separately and then 
hybridized together using standard techniques. This double-stranded oligonucleotide is 

40 referred to as the cassette. This cassette is designed to have 3' and 5* ends that are 

compatible with the ends of the linearized plasmid, such that it can be directly ligated to the 
plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence. 
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Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., WO 
88/06630), In this method, the amino acid sequences for a group of homologs or other 
related proteins are aligned, preferably to promote the highest homology possible. All of 
the amino acids which appear at a given position of the aligned sequences can be selected 
to create a degenerate set of combinatorial sequences. The variegated library of variants is 
generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a 
variegated gene library. For example, a mixture of synthetic oligonucleotides can be 
enzymatically ligated into gene sequences such that the degenerate set of potential 
sequences are expressible as individual peptides, or alternatively, as a set of larger fusion 
proteins containing the set of degenerate sequences. 

Primary High-Through-Put Methods for Screening Libraries of Peptide Fragments or 
Homologs 

Various techniques are known in the art for screening generated mutant gene 
20 products. Techniques for screening large gene libraries often include cloning the gene 

library into replicable expression vectors, transforming appropriate cells with the resulting 
library of vectors, and expressing the genes under conditions in which detection of a 
desired activity, e.g., in this case, binding to H. pylori polypeptide or an interacting protein, 
facilitates relatively easy isolation of the vector encoding the gene whose product was 
25 detected. Each of the techniques described below is amenable to high through-put analysis 
for screening large numbers of sequences created, e.g., by random mutagenesis techniques. 

Two Hybrid Systems 

Two hybrid assays such as the system described above (as with the other screening 
30 methods described herein), can be used to identify polypeptides, e.g.,fragments or analogs 
of a naturally-occurring H. pylori polypeptide, e.g., of cellular proteins, or of randomly 
generated polypeptides which bind to an H. pylori protein. (The K pylori domain is used 
as the bait protein and the library of variants are expressed as fish fusion proteins.) In an 
analogous fashion, a two hybrid assay (as with the other screening methods described 
35 herein), can be used to find polypeptides which bind a K pylori polypeptide. 

Display Libraries 

In one approach to screening assays, the candidate peptides are displayed on the 
surface of a cell or viral particle, and the ability of particular cells or viral particles to bind 
40 an appropriate receptor protein via the displayed product is detected in a "panning assay". 
For example, the gene library can be cloned into the gene for a surface membrane protein of 
a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 
88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 



10 
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5 1 8:136-140). In a similar fashion, a detectably labeled ligand can be used to score for 
potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can 
be used to detect homologs which retain ligand-binding activity. The use of fluorescently 
labeled ligands, allows cells to be visually inspected and separated under a fluorescence 
microscope, or, where the morphology of the cell permits, to be separated by a 
10 fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral particle. 
For instance, in the filamentous phage system, foreign peptide sequences can be expressed 
on the surface of infectious phage, thereby conferring two significant benefits. First, since 
these phage can be applied to affinity matrices at concentrations well over 10 13 phage per 
15 milliliter, a large number of phage can be screened at one time. Second, since each 

infectious phage displays a gene product on its surface, if a particular phage is recovered 
from an affinity matrix in low yield, the phage can be amplified by another round of 
infection. The group of almost identical E. coli filamentous phages Ml 3, fd., and fl are 
most often used in phage display libraries. Either of the phage gill or gVIII coat proteins 
20 can be used to generate fusion proteins without disrupting the ultimate packaging of the 
viral particle. Foreign epitopes can be expressed at the NH 2 -terminal end of pill and phage 
bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner 
et al. PCT publication WO 90/02909; Garrard et al.. PCT publication WO 92/09690; Marks 
et al. (\992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBOJ 12:725-734; 
25 Clackson et al. (1 991 ) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461 ). 
A common approach uses the maltose receptor of £ coli (the outer membrane 
protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
30 available for binding to ligands, e.g., to antibodies, and can elicit an immune response when 
the cells are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. 
(1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and 
PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface 
structures have served as vehicles for peptide display. Peptides can be fused to pilin, a 
35 protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic 
information (Thiry et al. (\9%9) Appl. Environ. Microbiol. 55, 984-993). Because of its 
role in interacting with other cells, the pilus provides a useful support for the presentation 
of peptides to the extracellular environment. Another large surface structure used for 
peptide display is the bacterial motile organ, the flagellum. Fusion of peptides to the 
40 subunit protein flagellin offers a dense array of many peptide copies on the host cells 
(Kuwajima et al. (1988) Bio/Tech. 6, 1080-1 083). Surface proteins of other bacterial 
species have also served as peptide fusion partners. Examples include the Staphylococcus 
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protein A and the outer membrane protease IgA of Neisseria (Hansson et al. (1992) J. 
Bacteriol. 174, 4239-4245 and Klauser et al. (1990) EMBOJ. 9, 1991-1999). 

In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DNA occurs by the containment of the 
DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the 
peptide captures the particle and the DNA within. An alternative scheme uses the DNA- 
binding protein Lad to form a link between peptide and DNA (Cull et al. (1 992) PNAS 
USA 89:1865-1 869). This system uses a plasmid containing the Lad gene with an 
oligonucleotide cloning site at its 3'-end. Under the controlled induction by arabinose, a 
Lacl-peptide fusion protein is produced. This fusion retains the natural ability of Lad to 
bind to a short DNA sequence known as LacO operator (LacO). By installing two copies 
of LacO on the expression plasmid, the Lacl-peptide fusion binds tightly to the plasmid that 
encoded it. Because the plasmids in each cell contain only a single oligonucleotide 
sequence and each cell expresses only a single peptide sequence, the peptides become 
specifically and stably associated with the DNA sequence that directed its synthesis. The 
cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix 
of immobilized receptor to recover the complexes containing active peptides. The 
associated plasmid DNA is then reintroduced into cells for amplification and DNA 
sequencing to determine the identity of the peptide ligands. As a demonstration of the 
practical utility of the method, a large random library of dodecapeptides was made and 
selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort 
of peptides was recovered, all related by a consensus sequence corresponding to a six- 
residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl. Acad. Sci. U.S.A. 89-1869) 

This scheme, sometimes referred to as peptides-on-plasmids, differs in two 
important ways from the phage display methods. First, the peptides are attached to the C- 
terminus of the fusion protein, resulting in the display of the library members as peptides 
having free carboxy termini. Both of the filamentous phage coat proteins, pill and pVIII, 
are anchored to the phage through their C-termini, and the guest peptides are placed into the 
outward-extending N-terminal domains. In some designs, the phage-displayed peptides are 
presented right at the amino terminus of the fusion protein. (Cwirla. et al. (1990) Proc. 
Natl. Acad. Sci. U.S.A. 87, 6378-6382) A second difference is the set of biological biases 
affecting the population of peptides actually present in the libraries. The Lad fusion 
molecules are confined to the cytoplasm of the host cells. The phage coat fusions are 
exposed briefly to the cytoplasm during translation but are rapidly secreted through the 
inner membrane into the periplasmic compartment, remaining anchored in the membrane 
by their C-terminal hydrophobic domains, with the N-termini, containing the peptides, 
protruding into the periplasm while awaiting assembly into phage particles. The peptides 
in the Lacl and phage libraries may differ significantly as a result of their exposure to 
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5 different proteolytic activities. The phage coat proteins require transport across the inner 
membrane and signal peptidase processing as a prelude to incorporation into phage. 
Certain peptides exert a deleterious effect on these processes and are underrepresented in 
the libraries (Gallop et al. ( 1 994) J. Med. Chem. 37(9): 1 233- 1 25 1 ). These particular biases 
are not a factor in the Lad display system. 

10 The number of small peptides available in recombinant random libraries is 

enormous. Libraries of 10 7 -10 9 independent clones are routinely prepared. Libraries as 
large as 1 0 1 1 recombinants have been created, but this size approaches the practical limit 
for clone libraries. This limitation in library size occurs at the step of transforming the 
DNA containing randomized segments into the host bacterial cells. To circumvent this 

1 5 limitation, an in vitro system based on the display of nascent peptides in polysome 

complexes has recently been developed. This display library method has the potential of 
producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 

20 In one application of this method (Gallop et al. (1994) J. Med Chem. 37(9):1233- 

1251), a molecular DNA library encoding 10 12 decapeptides was constructed and the 
library expressed in an E. coli S30 in vitro coupled transcription/translation system. 
Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a 
substantial proportion of the RNA in polysomes and yielding complexes containing nascent 

25 peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be 
affinity purified on immobilized receptors in much the same way as the more conventional 
recombinant peptide display libraries are screened. RNA from the bound complexes is 
recovered, converted to cDNA, and amplified by PCR to produce a template for the next 
round of synthesis and screening. The polysome display method can be coupled to the 

30 phage display system. Following several rounds of screening, cDNA from the enriched 
pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide 
expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing 
vector for peptide identification. By expressing the polysome-derived peptides on phage, 
one can either continue the affinity selection procedure in this format or assay the peptides 

35 on individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et al. (\992) Anal. Biochem 204,357-364). To identify 
the sequences of the active peptides one sequences the DNA produced by the phagemid 
host. 



40 



Secondary Screens 

The high through-put assays described above can be followed by secondary screens 
in order to identify further biological activities which will, e.g., allow one skilled in the art 
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5 to differentiate agonists from antagonists. The type of a secondary screen used will depend 
on the desired activity that needs to be tested. For example, an assay can be developed in 
which the ability to inhibit an interaction between a protein of interest and its respective 
ligand can be used to identify antagonists from a group of peptide fragments isolated 
though one of the primary screens described above. 
1 0 Therefore, methods for generating fragments and analogs and testing them for 

activity are known in the art. Once the core sequence of interest is identified, it is routine to 
perform for one skilled in the art to obtain analogs and fragments. 

Peptide Mimetics 

1 5 The invention also provides for reduction of the protein binding domains of the 

subject H. pylori-like family polypeptides, e.g., an H. pylori polypeptide, to generate 
mimetics, e.g. peptide or non-peptide agents. The peptide mimetics are able to disrupt 
binding of a K pylori to its counter ligand, e.g., in the case of an H. pylori polypeptide 
binding to a naturally occurring ligand. The critical residues of a subject K pylori 

20 polypeptide which are involved in molecular recognition of a polypeptide can be 

determined and used to generate K /Ty/on-derived peptidomimetics which competitively or 
noncompetatively inhibit binding of the H. pylori polypeptide with an interacting 
polypeptide (see, for example, "Peptide inhibitors of human papillomavirus protein binding 
to retinoblastoma gene protein" European patent applications EP-4 12,762 A and EP- 

25 B3 1 ,080A). For example, scanning mutagenesis can be used to map the amino acid 
residues of a particular H. pylori polypeptide involved in binding an interacting 
polypeptide, peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be 
generated which mimic those residues in binding to an interacting polypeptide, and which 
therefore can inhibit binding of an H. pylori polypeptide to an interacting polypeptide and 

30 thereby interfere with the function of H. pylori polypeptide. For instance, non- 

hydrolyzable peptide analogs of such residues can be generated using benzodiazepine (e.g., 
see Freidinger et al. in Peptides: Chemistry and Biology^ G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: 
Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), 

35 substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. 
Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene 
pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in 
Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) 
Pierce Chemical Co. Rockland, IL, 1985), p-turn dipeptide cores (Nagai et al. (1985) 

40 Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1 : 123 1), and 0- 
aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Communl26:4\9; and Dann et 
al. (1986) Biochem Biophys Res Commun 134:71). 
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5 Kits 

The nucleic acid, peptides and antibodies of the present invention can be combined 
with other reagents and articles to form kits. Kits for diagnostic purposes typically 
comprise the nucleic acid, peptides or antibodies in vials or other suitable vessels. Kits 
typically comprise other reagents for performing hybridization reactions, polymerase chain 

1 0 reactions (PCR), or for reconstitution of lyophilized components, such as aqueous media, 
salts, buffers, and the like. Kits may also comprise reagents for sample processing such as 
detergents, chaotropic salts and the like. Kits may also comprise immobilization means 
such as particles, supports, wells, dipsticks and the like. Kits may also comprise labeling 
means such as dyes, developing reagents, radioisotopes, fluorescent agents, luminescent or 

1 5 chemiluminescent agents, enzymes, intercalating agents and the like. With the nucleic acid 
and amino acid sequence information provided herein, individuals skilled in art can readily 
assemble kits to serve their particular purpose. 

Exemplification 

20 I. Cloning and Sequencing of H. py lori DNA 

H. pylori chromosomal DNA was isolated according to a basic DNA protocol 
outlined in Schleif R.F. and Wensink P.C M Practical Methods in Molecular Biology, p.98, 
Springer-Verlag, NY., 1981 , with minor modifications. Briefly, cells were pelleted, 
resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer (5.1 M 

25 guanidium thiocyanate, 0. 1 M EDTA, pH 8.0, 0.5% N-laurylsarcosine) was added. 

Suspension was chilled and ammonium acetate (NH 4 Ac) was added to final concentration 
of 2.0 M. DNA was extracted, first with chloroform, then with phenol-chloroform, and 
reextracted with chloroform. DNA was precipitated with isopropanol, washed twice with 
70% EtOH, dried and resuspended in TE. 

30 Following isolation whole genomic K pylori DNA was nebulized (Bodenteich et 

al., Automated DNA Sequencing and Analysis (J.C. Venter, ed.), Academic Press, 1994) to 
a median size of 2000 bp. After nebulization, the DNA was concentrated and separated on 
a standard 1% agarose gel. Several fractions, corresponding to approximate sizes 900-1300 
bp, 1300-1 700 bp, 1 700-2200 bp, 2200-2700 bp, were excised from the gel and purified by 

35 the GeneClean procedure (BiolOl , Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 
The healed DNA was then ligated to unique BstXI-linkcr adapters in 100-1000 fold molar 
excess. These linkers are complimentary to the BstXl-cut pMPX vectors, while the 
overhang is not self-complimentary. Therefore, the linkers will not concatemerize nor will 

40 the cut-vector religate itself easily. The linker-adopted inserts were separated from the 
unincorporated linkers on a 1% agarose gel and purified using GeneClean. The linker- 
adapted inserts were then ligated to each of the 20 pMPX vectors to construct a series of 
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"shotgun" subclone libraries. The vectors contain an out-of-frame lacZ gene at the cloning 
site which becomes in-frame in the event that an adapter-dimer is cloned, allowing these to 
be avoided by their blue-color. 

All subsequent steps were based on the multiplex DNA sequencing protocols 
outlined in Church G.M. and Kieffer-Higgins S., Science 240:185-1 88, 1 988. Only major 
modifications to the protocols are highlighted. Briefly, each of the 20 vectors was then 
transformed into DH5ct competent cells (Gibco/BRL, DH5a transformation protocol). The 
libraries were assessed by plating onto antibiotic plates containing ampicillin, methicillin 
and IPTG/Xgal. The plates were incubated overnight at 37<>C. Successful transformants 
were then used for plating of clones and pooling into the multiplex pools. The clones were 
picked and pooled into 40 ml growth medium cultures. The cultures were grown overnight 
at 370C. DNA was purified using the Qiagen Midi-prep kits and Tip-100 columns 
(Qiagen. Inc.). In this manner, 1 00 ug of DNA was obtained per pool. 1 5 96-weIl plates of 
DNA were generated to obtain a 5-10 fold sequence redundancy with 250-300 base average 
read-lengths. 

These purified DNA samples were then sequenced using the multiplex DNA 
sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins S., 
Science 240: 1 85-188, 1 988) or by Sequithrem (Epicenter Technologies) dideoxy 
sequencing protocols. The sequencing reactions were electrophoresed and transferred onto 
nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P. and 
Church G.M., Methods in Enzemology 21 8: 1 87-222, 1 993) or by electroblotting (Church, 
supra). 24 samples were run per gel. 45 successful membranes were produced by chemical 
sequencing and 8 were produced by dideoxy sequencing. The DNA was covalently bound 
to the membranes by exposure to ultraviolet light, and hybridized with labeled 
oligonucleotides complimentary to tag sequences on the vectors (Church, supra). The 
membranes were washed to rinse off non-specifically bound probe, and exposed to X-ray 
film to visualize individual sequence ladders. After autoradiography, the hybridized prob, 
was removed by incubation at 650C, and the hybridization cycle repeated with another tag 
sequence until the membrane has been probed 38 times for chemical sequencing 
membranes and 10 times for the dideoxy sequencing membranes. Thus, each gel produced 
a large number of films, each containing new sequencing information. Whenever a new 
blot was processed, it was initially probed for an internal standard sequence added to each 
of the pools. 

Digital images of the films were generated using a laser-scanning densitometer 
(Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on computer 
workstations (VaxStation 4000*s) using the program REPLICA™ (Church et al.. 
Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), Academic Press, 1994). 
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5 Image processing included lane straightening, contrast adjustment to smooth out intensity 
differences, and resolution enhancement by iterative gaussian deconvolution. The 
sequences were then automatically picked in REPLICA™ and displayed for interactive 
proofreading before being stored in a project database. The proofreading was accomplished 
by a quick visual scan of the film image followed by mouse clicks on the bands of the 

1 0 displayed image to modify the base calls. For typical sequences derived by chemical 
sequencing, the error rate of the REPLICA™ base calling software was 2-5% with most 
errors occurring near the end of a sequence read. Many of the sequence errors could be 
detected and corrected because multiple sequence reads covering the same portion of the 
genomic DNA provide adequate sequence redundancy for editing. Each sequence 

1 5 automatically received a number correspond to (microtiter plate and probe information) and 
lane set number (corresponding to microtiter plate columns). This number serves as a 
permanent identifier of the sequence so it is always possible to identify the original of any 
particular sequence without recourse to a specialized database. 

Routine assembly of//, pylori sequences was done using the program FALCON 

20 (Church, Church et al., Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), 
Academic Press, 1994). This program has proven to be fast and reliable for most 
sequences. The assembled contigs were displayed using a modified version of 
GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al., Nucleic 
Acid Res. 12:387-95, 1984) that interacts with REPLICA™. This provided for an 

25 integrated editor that allows multiple sequence gel images to be instantaneously called up 
from the REPLICA™ database and displayed to allow rapid scanning of contigs and 
proofreading of gel traces where discrepancies occurred between different sequence reads 
in the assembly. 

30 II. Identification, cloning and expression of recom binant H. pylori DNA sequences 

To facilitate the cloning, expression and purification of membrane and secreted 
proteins from H. pylori a powerful gene expression system, the pET System (Novagen), for 
cloning and expression of recombinant proteins in E. coli, was selected. Also, a DNA 
sequence encoding a peptide tag, the His-Tag, was fused to the 3' end of DNA sequences of 

35 interest in order to facilitate purification of the recombinant protein products. The 3' end 
was selected for fusion in order to avoid alteration of any 5' terminal signal sequence. The 
exception to the above was ppiB, a gene cloned for use as a control in the expression 
studies. In this study, the sequence forK pylori ppiB contains a DNA sequence encoding a 
His-Tag fused to the 5' end of the full length gene, because the protein product of this gene 

40 does not contain a signal sequence and is expressed as a cytosolic protein. 
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PCR Amplification and cloning o/DNA sequences containing ORFsfor membrane and 
secreted proteins from the J99 Strain of Helicobacter pylori. 

Sequences chosen (from the list of the DNA sequences of the invention) for cloning 
from the J99 strain of H. pylori were prepared for amplification cloning by polymerase 
chain reaction (PCR). Synthetic oligonucleotide primers (Table 2) specific for the 5' and 3' 
ends of open reading frames (ORFs) were designed and purchased (GibcoBRL Life 
Technologies, Gaithersburg, MD, USA). All forward primers (specific for the 5' end of the 
sequence) were designed to include an Ncol cloning site at the extreme 5' terminus, except 
for H. pylori sequence 4821082 (SEQ ID NO: 212) where NdeJ was used. These primers 
were designed to permit initiation of protein translation at a methionine residue followed by 
a valine residue and the coding sequence for the remainder of the native H. pylori DNA 
sequence. An exception is H. pylori sequence 482 1 082 (SEQ ID NO: 212) where the 
initiator methionine is immediately followed by the remainder of the native K pylori DNA 
sequence. All reverse primers (specific for the 3' end of any H. pylori ORF) included a 
EcpRI site at the extreme 5' terminus to permit cloning of each H. pylori sequence into the 
reading frame of the pET-28b. The pET-28b vector provides sequence encoding an 
additional 20 carboxy-terminal amino acids including six histidine residues (at the extreme 
C-terminus), which comprise the His-Tag. An exception to the above, as noted earlier, is 
the vector construction for the ppiB gene. A synthetic oligonucleotide primer specific for 
the 5' end of ppiB gene encoded a BamHI site at its extreme 5' terminus and the primer for 
the 3' end of the ppiB gene encoded a Xhol site at its extreme 5' terminus. 



TABLE 2 

Oligonucleotide primers used for PCR amnlifiratinn n f h nxt a 


Outer membrane Proteins 


Forward primer 5' to 3' 


Reverse Primer 5' to 3' 


7116626 (SEQ ID NO: 223) 


5-ATATCCATGGTGAGT 
TTGATGA-3' (SEQ ID 
NO:229) 


5'-ATGAATTCAATTTTT 
TATTTTGCCA-3' (SEQ ID 
NO:230) 


29479681 (SEQ ID NO: 179) 


5'-AATTCCATGGTGGGG 
GCTATG-3' (SEQ ID 
NO:231) 


5'-ATGAATTCTCGATAG 
CCAAAATC-3' (SEQ ID 
NO:232) 


Periplasmic/ Secreted 
Proteins 






30100332 (SEQ ID NO: 181) 


5'-ATTTCCATGGTCATG 
TCTCATATT-3' (SEQ ID 
NO:233) 


5'-ATGAATTCCATCTTT 
TATTCCAC-3' (SEQ ID 
NO:234) 


4721061 (SEQ ID NO: 211) 


S'-AACCATGGTGATTT 
TAAGCATTGAAAG-3' 
(SEQ ID NO:235) 


5'-AAGAATTCCACTCA 
A AATTTTTTA ACAG-3 ' 
(SEQIDN0.236) 
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Other Surface Proteins 






4821082 (SEQ ID NO: 212) 


5'-GATCATCCATATGTT 
ATCTTCTAAT-3' (SEQ ID 
NO:237) 


5'-TG AATTC A ACCA HI 
TAACCCTG-3' (SEQ ID 
NO:238) 


978477 (SEQ ID NO: 228) 


5'-TATACCATGGTGAA 
A I TI'I'l rCTTTTA-3' (SEQ 
IDNO:239) 


5'-AGAATTCAATTGCG 
TCTTGTAAAAG-3' (SEQ ID 
NO:240) 


Cytoplasmic Protein 






ppiB 


5'-TTATGGATCCAAAC 
CAATTAAAACT-V (SEQ ID 
NO:241) 


S'-TATCTCGAGTTATA 
GAGAAGGGC-V (SEQ ID 
NO:242) 



Genomic DNA prepared from the J99 strain of//, pylori (ATCC #55679) was used 
as the source of template DNA for PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). To amplify a 

1 0 DNA sequence containing an H. pylori ORF, genomic DNA (50 nanograms) was 
introduced into a reaction vial containing 2 mM MgCl2, 1 micromolar synthetic 
oligonucleotide primers (forward and reverse primers) complementary to and flanking a 
defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular 

1 5 Systems, Inc., Branchburg, NJ, USA) in a final volume of 1 00 microliters. The following 
thermal cycling conditions were used to obtain amplified DNA products for each ORF 
using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: 

Sequences 7116626 (SEQ ID NO: 223), 29479681 (SEQ ID NO: 179), 30100332 
20 (SEQ ID NO: 181), 4821082 (SEQ ID NO: 212) and 978477 (SEQ ID NO: 228); 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 1 5 sec, 30°C for 1 5 sec and 72°C for 1 .5 min 
23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

25 

Sequence 4721061 (SEQ ID NO: 21 1); 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 36°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 15 sec, 60°C for 15 sec and 72°C for 1.5 min 
30 Reactions were concluded at 72°C for 6 minutes. 

Conditions for amplification of H. pylori ppiB; 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 32°C for 15 sec and 72°C for 1.5 min 
35 25 cycles at 94°C for 1 5 sec, 56°C for 15 sec and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes 
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5 Upon completion of thermal cycling reactions, each sample of amplified DNA was 

washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, 
MD. USA). All amplified DNA samples were subjected to digestion with the restriction 
endonucleases, Ncol and EcoRI (New England BioLabs, Beverly, MA, USA), or in the 
case of H. pylori sequence 482 1 082 (SEQ ID NO: 2 1 2), with Ndel and EcoRI (Current 

10 Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). 
DNA samples were then subjected to electrophoresis on 1.0 % NuSeive (FMC 
BioProducts, Rockland, ME USA) agarose gels. DNA was visualized by exposure to 
ethidium bromide and long wave uv irradiation. DNA contained in slices isolated from the 
agarose gel was purified using the Bio 10! GencCIcan Kit protocol (Bio 101 Vista, CA, 

15 USA). 

Cloning ofH. pylori DNA sequences into the pET-28b prokaryotic expression vector. 

The pET-28b vector was prepared for cloning by digestion with Ncol and EcoRI, or 
in the case of H. pylori sequence 482 1 082 (SEQ ID NO: 2 1 2) with Ndel and EcoRI 

20 (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 
1994). In the case of cloning ppiB, the pET-28a vector, which encodes a His-Tag that can 
be fused to the 5' end of an inserted gene, was used and the cloning site prepared for 
cloning with the ppiB gene by digestion with BamHl and Xhol restriction endonucleases. 
Following digestion, DNA inserts were cloned (Current Protocols in Molecular 

25 Biology, John Wiley and Sons, Inc., F. Ausubel et al, eds., 1994) into the previously 
digested pET-28b expression vector, except for the amplified insert for ppiB. which was 
cloned into the pET-28a expression vector. Products of the ligation reaction were then used 
to transform the BL21 strain of £. coli (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) as described below. 

30 

Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain BL2 1 or E. coli strain BL2 1 (DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular, John Wiley and 

35 Sons, Inc., F. Ausubel et al, eds., 1994). Briefly, 1 microliter of ligation reaction was 
mixed with 50 microliters of electrocompetent cells and subjected to a high voltage pulse, 
after which, samples were incubated in 0.45 milliliters SOC medium (0.5% yeast extract, 
2.0 % tryptone, 1 0 mM NaCl, 2.5 mM KC1, 1 0 mM MgC12, 1 0 mM MgS04 and 20, mM 
glucose) at 37°C with shaking for 1 hour. Samples were then spread on LB agar plates 

40 containing 25 microgram/ml kanamycin sulfate for growth overnight. Transformed 

colonies of BL21 were then picked and analyzed to evaluate cloned inserts as described 
below. 
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Identification of recombinant pET expression plasmids carrying H. pylori sequences 

Individual BL21 clones transformed with recombinant pET-28b-H.pylori ORFs 
were analyzed by PCR amplification of the cloned inserts using the same forward and 
reverse primers, specific for each H. pylori sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verified the integration of the H. 
pylori sequences in the expression vector (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). 

Isolation and Preparation of plasmidDNA from BL21 transformants 

Individual clones of recombinant pET-28b vectors carrying properly cloned II. 
pylori ORFs were picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
kanamycin sulfate overnight. The following day plasmid DNA was isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA). 

Expression of recombinant H. pylori sequences in E. coli 

The pET vector can be propagated in any E coli K-12 strain e.g. HMS174, HB101, 
JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for expression 
include E coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. 
These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lad 
gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA polymerase is 
induced by addition of isopropyl-B-D-thiogaiactoside (IPTG), and the T7 RNA polymerase 
transcribes any target plasmid, such as pET-28b, carrying a T7 promoter and a gene of 
interest. Strains used include: BL21(DE3) (Studier, F.W., Rosenberg, A.H., Dunn, J.J., and 
Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89). 

To express recombinant K pylori sequences, 50 nanograms of plasmid DNA 
isolated as described above was used to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression system kit). The lacZ 
gene (beta-galactosidase) was expressed in the pET-System as described for the H. pylori 
recombinant constructions. Transformed cells were cultured in SOC medium for 1 hour, 
and the culture was then plated on LB plates containing 25 micrograms/ml kanamycin 
sulfate. The following day, bacterial colonies were pooled and grown in LB medium 
containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 nM of 0.5 to 
1 .0 O.D. units, at which point, 1 millimolar IPTG was added to the culture for 3 hours to 
induce gene expression of the H. pylori recombinant DNA constructions . 

After induction of gene expression with IPTG, bacteria were pelleted by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets 
were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0. 1 M NaCl and 0. 1 
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5 mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 min at 4°C Wet 
pellets were weighed and frozen at -80°C until ready for protein purification. 

III. Purification of recombinant proteins from £ coli 
Analytical Methods 

1 o The concentrations of purified protein preparations were quantified 

spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations were also 
measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, and 
Lowry, O.H., Roscbrough, N„ Farr, A.L. & Randall, R.J. (1951) J. Biol. Chcm. 193, pages 

1 5 265-275, using bovine serum albumin as a standard. 

SDS-polyacrylamide gels (12% or 4.0 to 25 % acrylamide gradient gels) were 
purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue. 
Molecular weight markers included rabbit skeletal muscle myosin (200 kDa), £ coli (- 
galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum albumin 

20 (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (3 1 kDa), soybean trypsin 
inhibitor (21 .5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 kDa). 



L Purification of soluble proteins 

All steps were carried out at 4°C. Frozen cells were thawed, resuspended in 5 

25 volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaCl, 5 mM imidazole with 10% 
glycerol, 0.1 % 2-mercaptoethanol, 200 jig/ ml lysozyme, 1 mM phenylmethylsulfonyl 
fluoride (PMSF), and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-l-chloro-3-[4- 
tosylamido]-7-amino-2-heptanone (TLCK), L-l -chloro-3-[4-tosylamido]-4-phenyl-2- 
butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages through 

30 a small volume microfluidizer (Model M-l 1 OS, Microfluidics International Corporation, 
Newton, MA). The resultant homogenate was made 0.1 % Brij 35, and centrifuged at 
100,000 x g for 1 hour to yield a clear supernatant (crude extract). 

Following filtration through a 0.8 ^m Supor filter (Gelman Sciences, FRG) the 
crude extract was loaded directly onto a Ni 2+ " nitrilotriacetate-agarose (NTA) with a 5 

35 milliliter bed volume (Hochuli, E., Dbeli, H., and Schacheer, A. (1987) J. Chromatography 
41 1, 177-184) pre-equilibrated in lysis buffer containing 10 % glycerol, 0.1 % Brij 35 and 1 
mM PMSF. The column was washed with 250 ml (50 bed volumes) of lysis buffer 
containing 10 % glycerol, 0.1 % Brij 35, and was eluted with sequential steps of lysis 
buffer containing 10 % glycerol, 0.05 % Brij 35, 1 mM PMSF, and 20, 100, 200, and 500 

40 mM imidazole in succession. Fractions were monitored by absorbance at OD280 nm, and 
peak fractions were analyzed by SDS-PAGE. Fractions containing the recombinant protein 
eluted at 100 mM imidazole. 
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5 Recombinant beta-galactosidase (lacZ) and peptidyl-prolyl cis-trans isomerase (ppiB) 

Fractions containing the recombinant proteins from the Nj2 + -NTA-agarose columns 
were pooled and then concentrated to approximately 5 ml by centrifugal filtration 
(Centriprep-10, Amicon, MA), and loaded directly onto a 180-ml column (1.6 X 91 cm) of 
Sephacryl S-100 HR gel filtration medium equilibrated in Buffer A (10 mM Hepes, pH 7.5, 
10 150 mM NaCl, 0. 1 mM EGTA) and run in Buffer A at 1 8 ml/h. Fractions containing the 
recombinant protein were identified by absorbance at 280 nm and analyzed by SDS-PAGE. 
Fractions were pooled and concentrated by centrifugal filtration. 

Recombinant protein 7116626 (SEQ ID NO: 223) 

1 5 Fractions containing the recombinant protein from the Ni 2+ -NTA-agarose column 

were pooled and dialyzed overnight against I liter of dialysis buffer (10 mM MOPS, pH 
6.5, 50 mM NaCl, 0.1 mM EGTA, 0.02% Brij 35 and 1 mM PMSF). In the morning, a fine 
white precipitate was removed by centrifugation and the resulting supernatant was loaded 
onto an 8 ml (8 x 75 mm) MonoS high performance liquid chromatography column 

20 (Pharmacia Biotechnology, Inc., Piscataway, NJ, USA) equilibrated in buffer B (10 mM 

MOPS, pH 6.5, 0.1 mM EGTA) containing 50 mM NaCl. The column was washed with 10 
bed volumes of buffer B containing 50 mM NaCl, and developed with a 50-ml linear 
gradient of increasing NaCl (50 to 500 mM). Recombinant protein 71 16626 (SEQ ID NO: 
223) eiuted as a sharp peak at 300 mM NaCl. 

25 

2. Purification of insoluble proteins from inclusion bodies 

The following steps were carried out at 4°C. Cell pellets were resuspended in lysis 
buffer with 1 0% glycerol 200 ml lysozyme, 5 mM EDTA. 1 mM PMSF and 0.1 % - 
mercaptoethanol. After passage through the cell disrupter, the resulting homogenate was 

30 made 0.2 % deoxycholate, stirred 1 0 minutes, then centriftiged at 20,000 x g, for 30 min. 
The pellets were washed with lysis buffer containing 10 % glycerol, 10 mM EDTA, 1% 
Triton X-100, 1 mM PMSF and 0.1% -mercaptoethanol, followed by several washes with 
lysis buffer containing 1 M urea, 1 mM PMSF and 0.1 % 2-mercaptoethanol. The resulting 
white pellet was composed primarily of inclusion bodies, free of unbroken cells and 

3 5 membranous materials. . 

Recombinant proteins 30100332 (SEQ ID NO: 181), 4721061 (SEQ ID NO: 211) 

The following steps were carried out at room temperature. Purified inclusion bodies 
were dissolved in 20 ml 8.0 M urea in lysis buffer with 1 mM PMSF and 0. 1 % 2- 
40 mercaptoethanol, and incubated at room temperature for 1 hour. Materials that did not 
dissolve were removed by centrifugation. The clear supernatant was filtered, then loaded 
onto a Ni2+ -NTA agarose column pre-equilibrated in 8.0 M urea in Lysis Buffer. The 
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5 column was washed with 250 ml (50 bed volumes) of lysis buffer containing 8 M urea, 1.0 
mM PMSF and 0.1 % 2-mercaptoethanol, and developed with sequential steps of lysis 
buffer containing 8M urea, 1 mM PMSF ? 0.1 % 2-mercaptoethanol and 20, 100, 200, and 
500 mM imidazole in succession. Fractions were monitored by absorbance at OD28O nm, 
and peak fractions were analyzed by SDS-PAGE. Fractions containing the recombinant 
1 0 protein eluted at 1 00 mM imidazole. 



Recombinant proteins 29479681 (SEQ ID NO: 179) } 978477 (SEQ ID NO: 228) 

The pellet containing the inclusion bodies was solubilized in buffer B containing 8 
M urea, 1 mM PMSF and 0 1 % 2-mercaptoethano!, and incubated for 1 hour at room 
1 5 temperature. Insoluble materials were removed by centrifiigation at 20,000 x g for 30 min, 
and the cleared supernatant was loaded onto a 15 ml ( 1 .6 x 7.5 cm ) SP-Sepharose column 
pre-equilibrated in buffer B, 6 M urea, 1 mM PMSF, 0.1 % 2-mercaptoethanol. After 
washing the column with 10 bed volumes, the column was developed with a linear gradient 
from 0 to 500 mM NaCl. 

20 

Dialysis and concentration of protein samples 

Urea was removed slowly from the protein samples by dialysis against Tris- 
buffered saline (TBS; 10 mM Tris pH 8.0, 150 mM NaCl) containing 0.5 % deoxycholate 
(DOC) with sequential reduction in urea concentration as follows; 6M, 4M, 3M, 2M, 1M, 
25 0.5 M and finally TBS without any urea. Each dialysis step was conducted for a minimum 
of 4 hours at room temperature. 

After dialysis, samples were concentrated by pressure filtration using Amicon 
stirred-cells. Protein concentrations were measured using the methods of Perkins (1986 
Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and Lowry 
30 ((1951) J. Biol. Chem. 193, pages 265-275). 

The recombinant proteins purified by the methods described above are summarized 
in Table 3 below. 



TABLE 3 



J99 
Sequence 
Identifier 


Homolog 
identified 
by Blast 


Gene 
symbol of 
Homolog 


Baccterial 
fraction 
used to 
purify 

recombinant 
proteins 


Method of 
purification 


Relative 
MW on 
SDS-PAGE 
gel 


Final Con- 
centration 
of purified 
protein 


Composition 
of buffer 


Outer Membrane Proteins 


71 16626 
(SEQ ID 
NO: 223) 


P26093 


e(P4) 


Soluble 
fraction 


His-Tag 


29 kDa 


0.8 mg/ml 


A 














1.85 mg/ml 


C 


29479681 
(SEQ ID 
NO: 179) 


P13036 


fecA 


Inclusions 
bodies 


SP- 
Sepharose 


23 kDa 


2.36 mg/ml 


B 
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0.5 mg ml 


B 
















as dry pellet 










gel filtration SI 00 HR 






Periplasm ic/Secreted Protein 


3010032 
(SEQ ID 
NO: 181) 


P23847 


dppA 


Inclusion 
bodies 


His-Tag 


It kDa 


2.88 mg/ml 


B 


4721061 
(SEQ ID 
NO: 211) 


P36175 


GCP 


Inclusion 
bodies 


His-Tag 


38 kDa 


2.8 mg/ml 


B 


Other Surface Proteins 


4821082 
(SEQ ID 
NO:2l2) 


P08089 


M protein 


Inclusion 
bodies 


His-Tag 


20 kDa 


1.16 mg/ml 


B 


978477 
(SEQ ID 
NO: 228) 


L28919 


FBP54 


Inclusion 
bodies 


SP- 
Sepharose 


44 kDa 


2.56 mg/ml 


B 














0.3 mg/ml 


B 


Control Proteins with His- Tag 




P00722 


lacZ 


Soluble 
fraction 


His-Tag 


116 kDa 


10 mg/ml 


A 










gel filtration S200 HR 










ppiB 


Soluble 
fraction 


His-Tag 


21 kDa 


4.4 mg/ml 


A 










gel filtration SI 00 HR 







Buffer compositions: 
A- 10 mM Hepes pH 7.5, 150 mM NaCl, 0.1 mM EGTA 
B= 10 mM Tris pH 8.0, 150 mM NaCl, 0.5 % DOC 
C= 10 mM MOPS pH 6.5, 300 mM NaCl, 0.1 EGTA 



10 IV. Analysis of H. pylori proteins as Vaccine candidates 

To investigate the immunomodulatory effect of H. pylori proteins, a mouse///. 
pylori model was used. This model mimics the human H. pylori infection in many 
respects. The focus is on the effect of oral immunization in H. pylori infected animals in 
order to test the concept of therapeutic oral immunotherapy. 



15 



20 



Animals 

Female SPF BALB/c mice were purchased from Bomholt Breeding center 
(Denmark). They were kept in ordinary makrolon cages with free supply of water and food. 
The animals were 4-6 weeks old at arrival. 



Infection 

After a minimum of one week of acclimatization, the animals were infected with a 
type 2 strain (VacA negative) of//, pylori (strain AH244, originally isolated from an ulcer 
patient). In our hands, this strain has earlier proven to be a good colonizer of the mouse 
25 stomach. The bacteria were grown overnight in Brucella broth supplemented with 1 0 % 
fetal calf serum, at 37°C in a microaerophilic atmosphere (10% C0 2> 5%0 2 ). The animals 
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5 were given an oral dose of omeprazole (400 |imol/kg) and 3-5 h after this an oral 

inoculation of//, pylori in broth (approximately 10 8 cfu/animal). Positive take of the 
infection was checked in some animals 2-3 weeks after the inoculation. 

Antigens 

1 o Recombinant H. pylori antigens were chosen based on their association with 

externally exposed //. pylori cell membrane. These antigens were selected from the 
following groups: (1.) Outer Membrane Proteins; (2.) Periplastic/Secreted proteins; (3.) 
Outer Surface proteins; and (4.) Inner Membrane proteins. All recombinant proteins were 
constructed with a hexa-HIS tag for purification reasons and the non-Hclicobactcr pylori 
15 control protein (P-galactosidase from E. coli; LacZ), was constructed in the same way. 

All antigens were given in a soluble form, i.e. dissolved in either a HEPES buffer or 
in a buffer containing 0.5% Deoxycholate (DOC). 
The antigens are listed in Table 4 below. 

20 Table 4 

Helicobacter pylori proteins 

Outer membrane Proteins 

SEQ ID NO: 179 
25 SEQIDNO:223 

Periplastic/Secreted proteins 

SEQ ID NO:181 
SEQ IDNO:211 

30 

Other cell envelope proteins 

SEQ ID NO:212 
SEQ ID NO:228 

35 Control proteins 

{J-galactosidase (LacZ) 

Immunizations 

Ten animals in each group were immunized 4 times over a 34 day period 
40 (day 1 , 1 5, 25 and 35). Purified antigens in solution or suspension were given at a dose of 
100 fig/mouse. As an adjuvant, the animals were also given 10 ^g/mouse of Cholera toxin 
(CT) with each immunization. Omeprazole (400 |amol/kg) was given orally to the animals 
3-5 h prior to immunization as a way of protecting the antigens from acid degradation. 
Infected control animals received HEPES buffer + CT or DOC buffer + CT. Animals were 
45 sacrificed 2-4 weeks after final immunization. A general outline of the study is shown in 
Table 5 below. 
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5 Table 5 

Study outline, therapeutic immunization: 



Mice were all infected with H. pylori strain AH244 at day 30. Proteins are listed by their Seq ID #'s. 



Substance 


Mouse strain 


Dose/mouse 


Dates for 




n=10 




dosinp 


1. Controls, PBS 


Balb/c 


0.3 ml 


0, 14,24, 34 


2. Cholera toxin, 1 0 ng 


Balb/c 


0.3 ml 


0, 14,24,34 


3. Protein 179, 100 ng + CT 10 ng 


Balb/c 


0.3 ml 


0, 14. 24, 34 


4. Protein 181, 100 ng + CT 10 jig 


Balb/c 


0.3 ml 


0, 14, 24, 34 


5. Protein 21 1, 100 ng + CT 10 jig 


Balb/c 


0.3 ml 


0, 14. 24, 34 


6. Protein 212, 100 ng + CT 10 Mg 


Balb/c 


0.3 ml 


0, 14, 24, 34 


7. Protein 228, 100 jig + CT 10 ng 


Balb/c 


0.3 ml 


0, 14, 24, 34 


8. Protein 223, 100 ng + CT 10 ng 


Baib/c 


0.3 ml 


0, 14. 24, 34 



20 

Analysis of infection 

Mucosal infection: The mice were sacrificed by C0 2 and cervical dislocation. The 
abdomen was opened and the stomach removed. After cutting the stomach along the 
greater curvature, it was rinsed in saline. The mucosa from the antrum and corpus of an 
25 area of 25mm was scraped separately with a surgical scalpel. The mucosa scraping was 
suspended in Brucella broth and plated onto Blood Skirrow selective plates. The plates 
were incubated under microaerophilic conditions for 3-5 days and the number of colonies 
was counted. The identity of H. pylori was ascertained by urease and catalase test and by 
direct microscopy or Gram staining. 
30 The urease test was performed essentially as follows. The reagent, Urea Agar Base 

Concentrate, was purchased from DIFCO Laboratories, Detroit, MI (Catalog # 0284-61-3). 
Urea agar base concentrate was diluted 1:10 with water. 1 ml of the diluted concentrate 
was mixed with 100-200 ^1 of actively growing H. pylori cells. Color change to magenta 
indicated that cells were urease positive. 
35 The catalase test was performed essentially as follows. The reagent, N,N,N',N'- 

Tetramethyl-p-Phenylenediamine, was purchased from Sigma, St. Louis, MO (Catalog # 
T3 1 34). A solution of the reagent (1% w/v in water) was prepared. H. pylori cells were 
swabbed onto Whatman filter paper and overlaid with the 1% solution. Color change to 
dark blue indicated that the cells were catalase positive. 
40 Serum antibodies: From all mice serum was prepared from blood drawn by heart 

puncture. Serum antibodies were identified by regular ELISA techniques, where the 
specific antigens of Helicobacter pylori were plated. 

Mucosal antibodies: Gentle scrapings of a defined part of the corpus and of 4 cm of 
duodenum were performed in 50% of the mice in order to detect the presence of antibodies 
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5 in the mucous. The antibody titers were determined by regular ELISA technique as for 
serum antibodies. 

Statistical analysis: Wilcoxon-Mann- Whitney sign rank test was used for 
determination of significant effects of the antigens on Helicobacter pylori colonization. 
PO.05 was considered significant. Because the antrum is the major colonization site for 
10 Helicobacter most emphasis was put upon changes in the antral colonization. 

Results 

Antibodies in sera : All antigens tested given together with CT gave rise to a 
measurable specific titer in serum. The highest responses were seen with SEQ ID 

15 NOs:223, 21 1, and 212 (see Figure 2). 

Antibodies in mucus : In the mucus scrapings, specific antibodies against all 
antigens tested were seen. By far the strongest response was seen with SEQ ID NOs:l 81 , 
followed by 223 (see Figure 3). 

Therapeutic immunization effects: 

20 All control animals (BALB/c mice) were well colonized with H pylori (strain 

AH244) in both antrum and corpus of the stomach. Of the antigens tested 2 proteins (SEQ 
ID NOs: 21 1 and 212) gave a good and significant reduction and/or eradication of the H 
pylori infection. The degree of colonization of the antrum was lower following 
immunization with SEQ ID NOs: 228 and 223 compared to control. The effect of SEQ ID 

25 NOs: 1 79, and 1 8 1 did not differ from control. The control protein lacZ, i.e. the non-//. 
pylori protein, had no eradication effect and in fact had higher Helicobacter colonization 
compared to the HEPES + CT control. All data are shown in Figures 4 and 5 for proteins 
dissolved in HEPES and DOC respectively. Data is shown as geometric mean values. n=8- 
10 Wilcoxon-Mann- Whitney sign rank test * = p<0.05; x/10 = number of mice showing 

30 eradication of H. pylori over the total number of mice examined. 

The data presented indicate that all of the H pylori associated proteins included in 
this study, when used as oral immunogens in conjunction with the oral adjuvant CT, 
resulted in stimulation of an immune response as measured by specific serum and mucosal 
antibodies. A majority of the proteins led to a reduction, and in some cases complete 

35 clearance of the colonization of H. pylori in this animal model. It should be noted that the 
reduction or clearance was due to heterologous protection rather than homologous 
protection (the polypeptides were based on the H pylori J99 strain sequence and used in the 
therapeutic immunization studies against a different (AH244) challenge strain), indicating 
the vaccine potential against a wide variety of//, pylori strains. 

40 The highest colonization in the antrum was seen in animals treated with the non- 

Helicobacter protein LacZ, indicating that the effects seen with the Helicobacter pylori 
antigens were specific. 
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5 Taken together these data strongly support the use of these H. pylori proteins in a 

pharmaceutical formulation for the use in humans to treat and/or prevent H. pylori 
infections. 

V, Sequence Variance Analysis of eenes in Helicobacter pylori strains 
1 0 Three genes were cloned and sequenced from several strains of H. pylori to 

compare the DNA and deduced amino acid sequences. This information was used to 
determine the sequence variation between the K pylori strain, J99, and other H. pylori 
strains isolated from human patients. 

1 5 Preparation of Chromosomal DNA. 

Cultures of H pylori strains (as listed in Table 8) were grown in BLBB (1% 
Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, 5% 
Fetal Bovine Serum) to an OD^o of 0.2. Cells were centrifuged in a Sorvall RC-3B at 
3500 x g at 4°C for 15 minutes and the pellet resuspended in 0.95 mis of 10 mM Tris-HCl, 

20 0. 1 mM EDTA (TE). Lysozyme was added to a final concentration of lmg/ml along with 
SDS to 1% and RNAse A + Tl to 0.5mg/ml and 5 units/ml respectively, and incubated at 
37°C for one hour. Proteinase K was then added to a final concentration of 0.4mg/ml and 
the sample was incubated at 55°C for more than one hour. NaCl was added to the sample 
to a concentration of 0.65 M, mixed carefully, and 0.1 5 ml of 1 0% CTAB in 0.7M NaCL 

25 (final is 1 % CTAB/70mM NaCL) was added followed by incubation at 65°C for 20 
minutes. At this point, the samples were extracted with chloroformrisoamyl alcohol, 
extracted with phenol, and extracted again with chloroformrisoamyl alcohol. DNA was 
precipitated with either EtOH (1 .5 x volumes) or isopropanol (0.6 x volumes) at -70°C for 
1 0 minutes, washed in 70% EtOH and resuspended in TE. 

30 

PCR Amplification and cloning. 

Genomic DNA prepared from twelve strains of Helicobacter pylori was used as the 
source of template DNA for PCR amplification reactions (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1 994). To amplify a DNA 

35 sequence containing an H pylori ORF, genomic DNA (10 nanograms) was introduced into 
a reaction vial containing 2 mM MgCl 2 , 1 micromolar synthetic oligonucleotide primers 
(forward and reverse primers, see Table 6) complementary to and flanking a defined H 
pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 
0.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., 

40 Branchburg, NJ, USA) in a final volume of 20 microliters in duplicate reactions. 
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5 Table 6 

Oligonucleotide primers used for PCR amplification of//, pylori DNA sequences. 



Outer membrane 
Proteins 



Forward primer 5' to 3' 



Reverse Primer 5' to 3* 



SEQ lDNO:223 



5'-ATATCCATGGTGAGTTTGA 
TGA-3' (SEQIDNO:243) 



S'-ATGAATTCAA I 1 i'l 1TATTTT 
GCCA-3' (SEQ1DNQ:244) 



SEQ ID NO: 179 



5'-AATTCCATGGCTATCCAAAT 
CCG-3'(SEQIDNO:245) 



5'-ATGAATTCGCCAAAATCGTA 
GTATT-3' (SEQ ID NO:246) 



SEQ ID NO: 199 



5-GATACCATGGAATTTATGAA 
AAAG-3' (SEQ IDNO:247) 



5 f -TGAATTCGAAAAAGTGTAGT 
TATAC-3' (SEQ ID NO:248) 



The following thermal cycling conditions were used to obtain amplified DNA 
products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal 
1 0 cycler: 



Sequences (by SEQ ID NO:) 223 and 1 99; 
Denaturation at 94°C for 2 min, 

2 cycles at 94 l, C for 15 sec, 30°C for 1 5 sec and 72°C for 1 .5 min 
1 5 23 cycles at 94°C for 1 5 sec, 55°C for 1 5 sec and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes. 



Sequences (by SEQ ID NO:) 179; 
Denaturation at 94°C for 2 min, 
20 2 cycles at 94°C for 1 5 sec, 30°C for 20 sec and 72°C for 2 min 
25 cycles at 94°C for 1 5 sec, 55°C for 20 sec and 72°C for 2 min 
Reactions were concluded at 72°C for 8 minutes. 



Upon completion of thermal cycling reactions, each pair of samples were combined 
25 and used directly for cloning into the pCR cloning vector as described below. 

Cloning ofK pylori DNA sequences into the pCR TA cloning vector. 

All amplified inserts were cloned into the pCR 2.1 (pCRII in the case of H. pylori 
sequence 223) vector by the method described in the Original TA cloning kit (Invitrogen, 
30 San Diego, CA). Products of the ligation reaction were then used to transform the 

TOPI OF' (INVaF in the case of H. pylori sequence 223) strain of £. coli as described 
below. 

Transformation of competent bacteria with recombinant plasmids 
35 Competent bacteria, E coli strain TOPI OF or E. coli strain INVaF were 

transformed with recombinant pCR expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). Briefly, 2 microliters of 0.5 
micromolar BME was added to each vial of 50 microliters of competent cells. 



WO 97/19098 



PCT/US96/18542 



-61- 

5 Subsequently, 2 microliters of ligation reaction was mixed with the competent cells and 
incubated on ice for 30 minutes. The cells and ligation mixture were then subjected to a 
"heat shock 1 * at 42°C for 30 seconds, and were subsequently placed on ice for an additional 
2 minutes, after which, samples were incubated in 0.45 milliliters SOC medium (0.5% 
yeast extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgC12 ; 10 mM MgS04 
10 and 20, mM glucose) at 37°C with shaking for 1 hour. Samples were then spread on LB 
agar plates containing 25 microgram/ml kanamycin sulfate or 100 micrograms/ml 
ampicillan for growth overnight. Transformed colonies of TOPI OF' or INVaF' were then 
picked and analyzed to evaluate cloned inserts as described below. 

1 5 Identification of recombinant PCR plasmids carrying H. pylori sequences 

Individual TOPI OF' or INVaF' clones transformed with recombinant pCR-H.pylori 
ORFs were analyzed by PCR amplification of the cloned inserts using the same forward 
and reverse primers, specific for each H. pylori sequence, that were used in the original 
PCR amplification cloning reactions. Successful amplification verified the integration of 

20 the H. pylori sequences in the cloning vector (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). 

Individual clones of recombinant pCR vectors carrying properly cloned //. pylori 
ORFs were picked for sequence analysis. Sequence analysis was performed on ABI 
Sequencers using standard protocols (Perkin Elmer) using vector-specific primers (as found 

25 in PCRII or pCR2.1 , Invitrogen, San Diego, CA) and sequencing primers specific to the 
ORF as listed in Table 7 below. 

Table 7 



Oligonucleotide primers used for sequencing of//, pylori DNA sequences. 

30 



Outer 

membrane 

Proteins 


Forward primers 5' to 3* 


Reverse Primers 5 1 to 3' 








SEQ ID NO:223 


5'- nUAACACTri'lGA 1'IA 1 GCGG-3' 
(SEQ ID NO:249) 

S'-GGATTATGCGATTGTTTTACAAG- 
3' (SEQ ID NO:250) 


S'-GTCTTTAGCAAAAATGGCGTC- 
3' (SEQIDNO:251) 
5-AATGAGCGTAAGAGAGCCTTC- 
3' (SEQ IDNO:252) 


SEQIDNO:179 


5'-CTTATGGGGGTATTGTCA-3* (SEQ ID 
NO:253) 

5-AGCATGTGGGTATCCAGC-3' (SEQ 
ID NO:254) 


5'-AGGTTGTTGCCTAAAGACT-3' 
(SEQ IDNO:255) 
S'-CTGCCTCCACCTTTGATCo' 
(SEQ1DN0:256) 


SEQ ID NO: 199 


5VACCAATATCAATTGGCACT-3' (SEQ 
IDNO:257) 

5'-ACTTGGAAAAGCTCTGCA-3' (SEQ 
IDNO:258) 


S'-CTTGCTTGTCATATCTAGC-S' 
(SEQ ID NO:259) 
5^GTTGAAGTGTTGGTGCTA-3' 
(SEQ IDNO:260) 
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5'-CAAGCAAGTGGTTTGGTT7TAG-3' 
(SEQIDNO:261) 

5-TGGAAAGAGCAAATCATTGAAG-3' 
(SEQ1DN0:262) 


5-GCCCATAATCAAAAAGCCCAT- 
3' (SEQ IDNO:263) 
5'-CTAAAACCAAACCACTTGCT 
TGTC-3' (SEQ ID NO:264) 








Vector Primers 


5'-GTAAAACGACGGCCAG-3' (SEQ ID 
NO:265) 


5*-CAGGAAACAGCTATGAC-3' 
(SEQ IDNO:266) 



5 

Results 



The PCR error rate for the experiments described herein was determined to be 2 
base changes out of 4485 bases, which is equivalent to an estimated error rate of less than 
or equal to 0.04%. 

1 0 DN A sequence analysis was performed on four different open reading frames 

identified as genes and amplified by PCR methods from a dozen different strains of the 
bacterium Helicobacter pylori. The deduced amino acid sequences of three of the four 
open reading frames that were selected for this study showed statistically significant 
BLAST homology to defined proteins present in other bacterial species. Those ORFs 

1 5 included: SEQ ID NO:223, homologous to lipoprotein e (P4) present in the outer 

membrane of H. influenzae; SEQ ID NO: 1 79, homologous to fecA, an outer membrane 
receptor in iron (III) dicitrate transport in E. coli. SEQ ID NO: 1 99 was identified as an 
unknown open reading frame, because it showed low homology with sequences in the 
public databases. 

20 To assess the extent of conservation or variance in the ORFs across various strains 

of//, pylori, changes in DNA sequence and the deduced protein sequence were compared 
to the DNA and deduced protein sequences found in the J99 strain of//, pylori (see Table 8 
below). Results are presented as percent identity to the J99 strain of K pylori sequenced by 
random shotgun cloning. To control for any variations in the J99 sequence each of the four 

25 open reading frames were cloned and sequenced again from the J99 bacterial strain and that 
sequence information was compared to the sequence information that had been collected 
from inserts cloned by random shotgun sequencing of the J99 strain. 
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5 Table 8 

Multiple Strain DNA Sequence analysis of H. pylori Vaccine Candidates 

J99 Sea. ID #: 
Length of Region 
Sequenced: 

Strain Tested 



J99 

AH244 

AH4 

AH5 

AH15 

AH61 

5155 

5294 

7958 

5640 

AH18 

AH24 

n.d. = not done 



223 


223 


179 


179 


199 


199 


232 aa. 


696 nt 


182 a.a. 


548 nt. 


273 a.a. 


819 nt. 


AA 


Nuc. 


AA 


Nuc. 


AA identity 


Nuc. 


identity 


identity 


identity 


identity 




Identity 


100.00% 


100.00% 


100.00% 


100.00% 


99.63% 


99.88% 


n.d. 


n.d. 


99.09% 


96.71% 


98.90% 


96.45% 


97.84% 


95.83% 


n.d. 


n.d. 


97.80% 


9573% 


98.28% 


96.12% 


98.91% 


96.90% 


98.53% 


95.73% 


97.41% 


95.98% 


99.82% 


97.99% 


99.63% 


96.09% 


97.84% 


95.98% 


99.27% 


97.44% 


n.d. 


n.d. 


n.d. 


n.d. 


99.45% 


97.08% 


98.53% 


95.60% 


98.28% 


95.40% 


99.64% 


97.26% 


97.07% 


95.48% 


97.84% 


95.40% 


n.d. 


n.d. 


99.63% 


96.46% 


97.41% 


95.69% 


99.09% 


97.63% 


98.53% 


95.48% 


98.71% 


95.69% 


99.64% 


97.44% 


100.00% 


95.97% 


97.84% 


95.40% 


99.27% 


96.71% 


100.00% 


96.46% 



VI Experimental Knock-Out Protocol for the Determinatio n of Essential H. pylori Genes 

as Potential Therapeutic Targets 
1 0 Therapeutic targets are chosen from genes whose protein products appear to play 

key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, 

transcription, translation, regulation and colonization/virulence. 

The protocol for the deletion of portions of H. pylori genes/ORFs and the 

insertional mutagenesis of a kanamycin-resistance cassette in order to identify genes which 
1 5 are essential to the cell is modified from previously published methods (Labigne-Roussel et 

al., 1988, J. Bacteriology 170, pp. 1704-1708; Cover et al.,1994, J. Biological Chemistry 

269, pp. 10566-10573; Reyrat et al., 1995, Proc. Natl. Acad. Sci. 92, pp 8768-8772). The 

result is a gene "knock-out." 



20 Identification and Cloning of H. pylori Gene Sequences 

The sequences of the genes or ORFs (open reading frames) selected as knock-out 
targets are identified from the H. pylori genomic sequence and used to design primers to 
specifically amplify the genes/ORFs. All synthetic oligonucleotide primers are designed 
with the aid of the OLIGO program (National Biosciences, Inc.. Plymouth, MN 55447, 

25 USA), and can be purchased from Gibco/BRL Life Technologies (Gaithersburg, MD, 
USA). If the ORF is smaller than 800 to 1000 base pairs, flanking primers are chosen 
outside of the open reading frame. 
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5 Genomic DNA prepared from the Helicobacter pylori HP-J99 strain (ATCC 55679 ) 

is used as the source of template DNA for amplification of the ORFs by PGR (polymerase 
chain reaction) (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1994). For the preparation of genomic DNA from H. pylori, see 
Example I. PCR amplification is carried out by introducing 10 nanograms of genomic HP- 
10 J99 DNA into a reaction vial containing 1 0 mM Tris pH 8.3, 50 mM KC1, 2 mM MgCl 2 , 2 
microMolar synthetic oligonucleotide primers (forward=Fl and reverse=Rl), 0.2 mM of 
each deoxynucleotide triphosphate (dATP,dGTP, dCTP, dTTP), and 1.25 units of heat 
stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) 
in a final volume of 40 microliters. The PCR is carried out with Perkin Elmer 
1 5 Cetus/GeneAmp PCR System 9600 thermal cyclers. 

Upon completion of thermal cycling reactions, each sample of amplified DNA is 
visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) to 
determine that a single product of the expected size had resulted from the reaction. 
20 Amplified DNA is then washed and purified using the Qiaquick Spin PCR purification kit 
(Qiagen, Gaithersburg, MD, USA). 

PCR products are cloned into the pT7Blue T- Vector (catalog#69820-l, Novagen, 
Inc., Madison, WI, USA) using the TA cloning strategy (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). The ligation of the 
25 PCR product into the vector is accomplished by mixing a 6 fold molar excess of the PCR 
product, 10 ng of pT7Blue-T vector (Novagen), 1 microliter of T4 DNA Ligase Buffer 
(New England Biolabs, Beverly, MA, USA), and 200 units of T4 DNA Ligase (New 
England Biolabs) into a final reaction volume of 10 microliters. Ligation is allowed to 
proceed for 16 hours at 16°C. 
30 Ligation products are electroporated (Current Protocols in Molecular Biology, John 

Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) into eiectroporation-competent XL-1 
Blue or DH5-a E.coli cells (Clontech Lab., Inc. Palo Alto, CA, USA). Briefly, 1 microliter 
of ligation reaction is mixed with 40 microliters of electrocompetent cells and subjected to 
a high voltage pulse (25 microFarads, 2.5 kV, 200 ohms) after which the samples are 
35 incubated in 0.45 ml SOC medium (0.5% yeast extract, 2% tryptone, 10 mM NaCl, 2.5 mM 
KC1, 10 mM MgCl 2 , 10 mM MgS0 4 and 20 mM glucose) at 37°C with shaking for 1 hour. 
Samples are then spread onto LB (10 g/1 bacto tryptone, 5 g/1 bacto yeast extract, 10 g/1 
sodium chloride) plates containing 100 microgram/ml of Ampicillin, 0.3% X-gal, and 100 
microgram/ml IPTG. These plates are incubated overnight at 37°C. Ampicillin-resistant 
40 colonies with white color are selected, grown in 5 ml of liquid LB containing 100 

microgram/ml of Ampicillin, and plasmid DNA is isolated using the Qiagen miniprep 
protocol (Qiagen, Gaithersburg, MD, USA). 
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5 To verify that the correct Kpylori DNA inserts had been cloned, these pT7Blue 

plasmid DNAs are used as templates for PCR amplification of the cloned inserts, using the 
same forward and reverse primers used for the initial amplification of the J99 Kpylori 
sequence. Recognition of the primers and a PCR product of the correct size as visualized 
on a 2% TAE, ethidium bromide stained agarose gel are confirmation that the correct 

1 0 inserts had been cloned. Two to six such verified clones are obtained for each knock-out 
target, and frozen at -70°C for storage. To minimize errors due to PCR, plasmid DNA from 
these verified clones are pooled, and used in subsequent cloning steps. 

The sequences of the genes/ORFs are again used to design a second pair of primers 
which flank the region of H. pylon DNA to be either interrupted or deleted (up to 250 

1 5 basepairs) within the ORFs but are oriented away from each other. The pool of circular 
plasmid DNAs of the previously isolated clones are used as templates for this round of 
PCR. Since the orientation of amplification of this pair of deletion primers is away from 
each other, the portion of the ORF between the primers is not included in the resultant PCR 
product. The PCR product is a linear piece of DNA with H. pylori DNA at each end and 

20 the pT7BIue vector backbone between them which, in essence, results in the deletion of a 
portion of the ORFs. The PCR product is visualized on a 1% TAE, ethidium bromide 
stained agarose gel to confirm that only a single product of the correct size has been 
amplified. 

A Kanamycin-resistance cassette (Labigne-Roussel et ai., 1988 J. Bacteriology 170, 

25 1 704-1 708) is ligated to this PCR product by the TA cloning method used previously 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
editors, 1 994). The Kanamycin cassette containing a Campylobacter kanamycin resistance 
gene is obtained by carrying out an EcoRI digestion of the recombinant plasmid 
pCTB8:&™ (Cover et al.,1994, J. Biological Chemistry 269, pp. 10566-10573). The proper 

30 fragment (1 .4 kb) is isolated on a 1% TAE gel, and isolated using the QIAquick gel 

extraction kit (Qiagen, Gaithersburg, MD, USA). The fragment is end repaired using the 
Klenow fill-in protocol, which invol ved mixing 4ug of the DNA fragment, 1 microliter of 
dATP,dGTP, dCTP, dTTP at 0.5 mM, 2 microliter of Klenow Buffer (New England 
Biolabs) and 5 units of Klenow DNA Polymerase I Large (Klenow) Fragment (New 

35 England Biolabs) into a 20 microliter reaction, incubating at 30°C for 15 min, and 

inactivating the enzyme by heating to 75°C for 10 minutes. This blunt-ended Kanamycin 
cassette is then purified through a Qiaquick column (Qiagen, Gaithersburg, MD, USA) to 
eliminate nucleotides. The "T" overhang is then generated by mixing 5 micrograms of the 
blunt-ended kanamycin cassette, 10 mM Tris pH 8.3, 50 mM KCI, 2 mM MgCl 2 , 5 units of 

40 DNA Polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA), 20 
microliters of 5 mM dTTP, in a 100 microliter reaction and incubating the reaction for 2 
hours at 37°C. The "Kan-T M cassette is purified using a QIAquick column (Qiagen, 
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5 Gaithersburg, MD, USA). The PCR product of the deletion primers (F2 and R2) is ligated 
to the Kan-T cassette by mixing 10 to 25 ng of deletion primer PCR product, 50 - 75 ng 
Kan-T cassette DNA, 1 microliter lOx T4 DNA Ligase reaction mixture, 0.5 microliter T4 
DNA Ligase (New England Biolabs, Beverly, MA, USA) in a 10 microliter reaction and 
incubating for 16 hours at 16 U C. 

10 The ligation products are transformed into XL-1 Blue or DH5-a E.coli cells by 

electroporation as described previously. After recovery in SOC, cells are plated onto LB 
plates containing 100 microgram/ml Ampicillin and grown overnight at 37°C. These plates 
are then replica plated onto plates containing 25 microgram/ml Kanamycin and allowed to 
grow overnight. Resultant colonies have both the Ampicillin resistance gene present in the 

15 pT7Blue vector, and the newly introduced Kanamycin resistance gene. Colonies are picked 
into LB containing 25 microgram/ml Kanamycin and plasmid DNA is isolated from the 
cultured cells using the Qiagen miniprep protocol (Qiagen, Gaithersburg, MD, USA). 

Several tests by PCR amplification are conducted on these plasmids to verify that 
the Kanamycin is inserted in the K pylori gene/ORF, and to determine the orientation of 

20 the insertion of the Kanamycin-resistance gene relative to the H. pylori gene/ORF. To 
verify that the Kanamycin cassette is inserted into the H. pylori sequence, the plasmid 
DNAs are used as templates for PCR amplification with the set of primers originally used 
to clone the H. pylori gene/ORFs. The correct PCR product is the size of the deleted 
gene/ORF but increased in size by the addition of a 1 .4 kilobase Kanamycin cassette. To 

25 avoid potential polar effects of the kanamycin resistance cassette on H. pylori gene 

expression, the orientation of the Kanamycin resistance gene with respect to the knock-out 
gene/ORF is determined and both orientations are eventually used in H. pylori 
transformations (see below). To determine the orientation of insertion of the kanamycin 
resistance gene, primers are designed from the ends of the kanamycin resistance gene 

30 ("Kan- 1 11 5-ATCTTACCTATCACCTCAAAT-3' (SEQ ID NO:267), and "Kan-2" 5'- 
AGACAGCAACATCTTTGTGAA-3 1 (SEQ ID NO:268)). By using each of the cloning 
primers in conjunction with each of the Kan primers (4 combinations of primers), the 
orientation of the Kanamycin cassette relative to the H.pylori sequence is determined. 
Positive clones are classified as either in the "A" orientation (the same direction of 

35 transcription is present for both the H pylori gene and the Kanamycin resistance gene), or 
in the "B" orientation (the direction of transcription for the H.pylori gene is opposite to that 
of the Kanamycin resistance gene). Clones which share the same orientation (A or B) are 
pooled for subsequent experiments and independently transformed into H pylori. 



40 



Transformation of Plasmid DNA into H. pylori cells 

Two strains of H. pylori are used for transformation: HP-J99 (ATCC 55679) , the 
clinical isolate which provided the DNA from which the H. pylori sequence database is 
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5 obtained, and AH244, an isolate which had been passaged in, and has the ability to colonize 
the mouse stomach. Cells for transformation are grown at 37°C, 10% C0 2 , 100% humidity, 
either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells are grown to 
exponential phase, and examined microscopically to determine that the cells are "healthy" 
(actively moving cells) and not contaminated. If grown on plates, cells are harvested by 

10 scraping cells from the plate with a sterile loop, suspended in 1 ml of Brucella Broth, spun 
down (1 minute, top speed in eppendorf microfuge) and resuspended in 200 microliters 
Brucella Broth. If grown in Brucella Broth liquid, cells are centrifuged (15 minutes at 3000 
rpm in a Beckman TJ6 centrifuge) and the cell pellet resuspended in 200 microliters of 
Brucella broth. An aliquot of cells is taken to determine the optical density at 600 nm, in 

1 5 order to calculate the concentration of cells. An aliquot (1 to 5 OD 60 o units/25 microliter) 
of the resuspended cells is placed onto a prewarmed Sheep-Blood agar plate, and the plate 
is further incubated at 37°C, 6% C0 2 , 100% humidity for 4 hours. After this incubation, 
10 microliters of plasmid DNA (100 micrograms per microliter) is spotted onto these cells. 
A positive control (plasmid DNA with the ribonuclease H gene disrupted by kanamycin 

20 resistance gene) and a negative control (no plasmid DNA) are done in parallel. The plates 
are returned to 37°C, 6% C0 2 for an additional 4 hours of incubation. Cells are then 
spread onto that plate using a swab wetted in Brucella broth, and grown for 20 hours at 
37°C, 6% C0 2 . Cells are then transferred to a Sheep-Blood agar plate containing 25 
micrograms/ml Kanamycin, and allowed to grow for 3 to 5 days at 37°C, 6% C0 2 , 100% 

25 humidity. If colonies appear, they are picked and regrown as patches on a fresh Sheep- 
Blood agar plate containing 25 micrograms/ml Kanamycin. 

Three sets of PCR tests are done to verify that the colonies of transformants have 
arisen from homologous recombination at the proper chromosomal location. The template 
for PCR (DNA from the colony) is obtained by a rapid boiling DNA preparation method as 

30 follows. An aliquot of the colony (stab of the colony with a toothpick) is introduced into 
100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, and boiled for 6 minutes. An 
equal volume of phenol : chloroform (1:1) is added and vortexed. The mixture is 
microfuged for 5 minutes and the supernatant is used as DNA template for PCR with 
combinations of the following primers to verify homologous recombination at the proper 

35 chromosomal location. 

TEST 1 . PCR with cloning primers originally used to amplify the gene/ORF. A 
positive result of homologous recombination at the correct chromosomal location should 
show a single PCR product whose size is expected to be the size of the deleted gene/ORF 
but increased in size by the addition of a 1.4 kilobase Kanamycin cassette. A PCR product 

40 of just the size of the gene/ORF is proof that the gene had not been knocked out and that 
the transformant is not the result of homologous recombination at the correct chromosome 
location. 
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5 TEST 2. PCR with F3 (primer designed from sequences upstream of the gene/ORF 

and not present on the plasmid), and either primer Kan-1 or Kan-2 (primers designed from 
the ends of the kanamycin resistance gene), depending on whether the plasmid DNA used 
was of "A" or M B M orientation. Homologous recombination at the correct chromosomal 
location will result in a single PCR product of the expected size (i.e., from the location of 

10 F3 to the insertion site of kanamycin resistance gene). No PCR product or PCR product(s) 
of incorrect size(s) will prove that the plasmid had not integrated at the correct site and that 
the gene had not been knocked out. 

TEST 3. PCR with R3 (primer designed from sequences downstream of the 
gene/ORF and not present on the plasmid) and either primer Kan-1 or Kan-2. depending on 

1 5 whether the plasmid DNA used was of "A" or ,, B n orientation. Homologous recombination 
at the correct chromosomal location will result in a single PCR product of the expected size 
(i.e., from the insertion site of kanamycin resistance gene to the downstream location of 
R3). Again, no PCR product or PCR product(s) of incorrect size(s) will prove that the 
plasmid had not integrated at the correct site and that the gene had not been knocked out. 

20 Transformants showing positive results for all three tests above indicate that the 

gene is not essential for survival in vitro. 

A negative result in any of the three above tests for each transforrnant indicates that 
the gene had not been disrupted, and that the gene is essential for survival in vitro. 

In the event that no colonies result from two independent transformations while the 

25 positive control with the disrupted ribonuclease H plasmid DNA produces transformants, 
the plasmid DNA is further analyzed by PCR on DNA from transforrnant populations prior 
to plating for colony formation. This will verify that the plasmid can enter the cells and 
undergo homologous recombination at the correct site. Briefly, plasmid DNA is incubated 
according to the transformation protocol described above. DNA is extracted from the H. 

30 pylori cells immediately after incubation with the plasmid DNAs and the DNA is used as 
template for the above TEST 2 and TEST 3. Positive results in TEST 2 and TEST 3 would 
verify that the plasmid DNA could enter the cells and undergo homologous recombination 
at the correct chromosomal location. If TEST 2 and TEST 3 are positive, then failure to 
obtain viable transformants indicates that the gene is essential, and cells suffering a 

35 disruption in that gene are incapable of colony formation 

VII. High-throughput drug screen assay 
Cloning, expression and protein purification 

Cloning, transformation, expression and purification of the H. pylori target gene and 
40 its protein product,e.g., an H. pylori enzyme, to be used in a high-throughput drug screen 
assay, is carried out essentially as described in Examples II and III above. Development 
and application of a screening assay for a particular H. pylori gene product, peptidyl-propyl 
cis-trans isomerase, is described below as a specific example. 
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5 

Enzymatic Assay 

The assay is essentially as described by Fisher (Fischer, G„ et.al. (1984) Biomed. 
Biochim. Acta 43:1 101-1 1 11). The assay measures the cis-trans isomerization of the Ala- 
Pro bond in the test peptide N-sucxinyl-Ala-Ala-Pro-Phe-p-nitroanilide (Sigma # S-7388, 

10 lot # 84H5805). The assay is coupled with a-chymotrypsin, where the ability of the 
protease to cleave the test peptide occurs only when the Ala-Pro bond is in trans. The 
conversion of the test peptide to the trans isomer in the assay is followed at 390 nmona 
Beckman Model DU-650 spectophotometer. The data are collected every second with an 
average scanning of time of 0.5 second. Assays are carried out in 35 mM Hepes, pH 8.0, in 

1 5 a final volume of 400 ul, with 1 0 \xbA a-chymotrypsin (type 1-5 from bovine Pancreas, 
Sigma # C-7762, lot 23H7020) and 10 nM PPIase. To initiate the reaction, 1 0 ul of the 
substrate ( 2 mM N-Succinyl-Ala-Ala-Pro-Phe-p-nitroanilide in DMSO) is added to 390 \i\ 
of reaction mixture at room temperature. 

20 Enzymatic assay in crude bacterial extract. 

A 50 ml culture of Helicobacter pylori (strain J99) in Brucella broth is harvested at 
mid-log phase (OD6oo nm - 1) and resuspended in lysis buffer with the following protease 
inhibitors: 1 mM PMSF, and 10 ^g/ml of each of aprotinin, leupeptin, pepstatine, TLCK, 
TPCK, and soybean trypsin inhibitor. The suspension is subjected to 3 cycles of freeze- 

25 thaw ( 1 5 minutes at -70 ° C, then 30 minutes at room temperature), followed by sonication 
(three 20 second bursts). The lysate is centrifuged (1 2,000 g x 30 minutes) and the 
supernatant is assayed for enzymatic activity as described above. 

Many K pylori enzymes can be expressed at high levels and in an active form in £. 
colL Such high yields of purified proteins provide for the design of various high 

30 throughput drug screening assays. 

Other Embodiments 

Included in the invention are: allelic variations; natural mutants; induced mutants; 
proteins encoded by DNA that hybridizes under high or low stringency conditions to a 

35 nucleic acid which encodes a polypeptide as shown in SEQ ID NOs: 1-114 (for definitions 
of high and low stringency see Current Protocols in Molecular Biology, John Wiley & 
Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and, polypeptides 
specifically bound by antisera to H. pylori polypeptides, especially by antisera to an active 
site or binding domain of H. pylori polypeptide 

40 The invention also includes fragments, preferably biologically active fragments, or 

analogs of//, pylori polypeptides A biologically active fragment or analog is one having 
any in vivo or in vitro activity which is characteristic of the H. pylori polypeptides shown in 
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5 SEQ ID NOs: 1 1 5-228, or of other naturally occurring H. pylori polypeptides, e.g., one or 
more of the biological activities described above. Especially preferred are fragments which 
exist in vivo, e.g., fragments which arise from post transcriptional processing or which arise 
from translation of alternatively spliced RNA's. Fragments include those expressed in 
native or endogenous cells as well as those made in expression systems, e.g., in CHO cells. 

10 Because peptides such as H. pylori polypeptides often exhibit a range of physiological 
properties and because such properties may be attributable to different portions of the 
molecule, a useful H. pylori fragment or H. pylori analog is one which exhibits a biological 
activity in any biological assay for K pylori activity. Most preferably the fragment or 
analog possesses 10%, preferably 40% r or at least 90% of the activity of H. pylori, in any in 

1 5 vivo or in vitro assay. 

Analogs can differ from naturally occurring H. pylori polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 
include changes in acetylation, methylation, phosphorylation, carboxylation, or 
glycosylation. 

20 Preferred analogs include H. pylori polypeptides (or biologically active fragments 

thereof) whose sequences differ from the wild-type sequence by one or more conservative 
amino acid substitutions or by one or more non-conservative amino acid substitutions, 
deletions, or insertions which do not abolish the biological activity of the H. pylori 
polypeptide. Conservative substitutions typically include the substitution of one amino 

25 acid for another with similar characteristics, e.g., substitutions within the following groups: 
valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; 
asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 
Other conservative substitutions can be taken from the table below. 

30 TABLE 9 



CONSERVATIVE AMINO ACID REPLACEMENTS 



jpFor Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, D- 
Met, D-Ile,Orn, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu. D-Glu, Gin, D-GIn 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn ? Glu, D-Glu, Gin, D-GIn | 


Cysteine 


C 


D-Cys. S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


0 


D-GIn, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-GIn 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, (J-Ala Acp 


Isoleucine 


1 


D-lle, Val, D-Val, Leu, D-Leu, Met, D-Met 


Leucine 


L 


D-Leu, Val, D-Vai, Leu, D-Leu, Met, D-Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, 
He, D-Ile, Orn, D-Orn 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, D-Leu, Val, D-Val 
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Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His. D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenylproline, cis-3,4, or 5-phenylproline 


Proline 


P 


D-Pro, L-I-thioazoIidine-4-carboxylic acid. D-or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, alio-Thr, Met,D-Met, Met(O), D- 
Met(O), L-Cys, D-Cys 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met,D-Met, Met(O), D- 
Met(O), Val, D-Vai 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Valine 


V 


D-Val ; Leu, D-Leu, He, D-Ile, Met,D-Met 



5 



Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are; analogs that 
include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 
10 naturally occurring or synthetic amino acids, e.g., P or y amino acids; and cyclic analogs. 

As used herein, the term "fragment", as applied to a H. pylori analog, will ordinarily 
be at least about 20 residues, more typically at least about 40 residues, preferably at least 
about 60 residues in length. Fragments of//, pylori polypeptides can be generated by 
methods known to those skilled in the art. The ability of a candidate fragment to exhibit a 
1 5 biological activity of K pylori polypeptide can be assessed by methods known to those 
skilled in the art as described herein. Also included are H. pylori polypeptides containing 
residues that are not required for biological activity of the peptide or that result from 
alternative mRNA splicing or alternative protein processing events. 

In order to obtain an H. pylori polypeptide, H. pylori polypeptide-encoding DNA 
20 can be introduced into an expression vector, the vector introduced into a cell suitable for 
expression of the desired protein, and the peptide recovered and purified, by prior art 
methods. Antibodies to the peptides an proteins can be made by immunizing an animal, 
e.g., a rabbit or mouse, and recovering anti-//. pylori polypeptide antibodies by prior art 
methods. 

25 The nucleic acids and corresponding polypeptides of the invention were disclosed 

previously in the corresponding US application, U.S.S.N. 08/561,469, filed November 17, 
1995 (Attorney Docket No.: GTN-001CP). The correlation between sequence 
identification numbers in the above-identified parent applications and sequence 
identification numbers provided herein is outlined in Table 10 below. 
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5 TABLE 10 



Parent 


Relation 


New 


Parent 


Parent 


Relation- 


New 


Nucleic 


-ship 


Nucleic 


Amino 


amino acid 


ship 


Amino 


Acid SEQ 




Acid SEQ 


Acid SEQ 


HPP# 




Acid SEQ 


ID NO: 




ID NO: 


ID NO: 






ID NO: 


881 


a 


1 


385 


2 


a 


115 


882 


a 


2 


390 


9 


a 


116 


883 


a 


3 


401 


20 


a 


117 


884 


a 


4 


407 


26 


a 


118 


885 


a 


5 


409 


28 


a 


119 


886 


a 


6 


410 


29 


a 


120 


887 


a 


7 


413 


34 


a 


121 


888 


a 


8 


431 


55 


a 


122 


889 


a 


9 


435 


60 


a 


123 


890 


a 


10 


442 


68 


a 


124 


891 


a 


11 


445 


71 


a 


125 


892 


a 


12 


449 


75 


a 


126 


893 


a 


13 


463 


89 


a 


127 


894 


a 


14 


464 


90 


a 


128 


895 


a 


15 


467 


94 


a 


129 


896 




16 


470 


97 


a 


130 


897 


a 


17 


474 


101 


a 


131 


898 


a 


18 


476 


103 


a 


132 


899 


a 


19 


477 


104 


a 


133 


900 


a 


20 


480 


107 


a 
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EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
1 0 routine experimentation, many equivalents to the specific embodiments and methods 
described herein. Such equivalents are intended to be encompassed by the scope of the 
following claims- 



Other embodiments are within the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT 

(A) NAME: Astra Aktiebolag 

(B) STREET: S-151 85 

(C) CITY: Sodertalje 

(D) STATE: 

(E) COUNTRY: Sweden 

(F) POSTAL CODE (ZIP) : 



(ii) TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES 

RELATING TO HELICOBACTER PYLORI FOR 
DIAGNOSTICS AND THERAPEUTICS 



(iii) NUMBER OF SEQUENCES: 268 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: CD/ ROM ISO9660 
<B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/561,469 

(B) FILING DATE: 17-NOV-1995 



(vii) CORRESPONDENCE ADDRESS: 

(A) ADDRESEE: LAHIVE & COCKFIELD 

(B) STREET: 60 State Street. Suite 510 

(C) CITY: Boston 

<D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP : 02109-1875 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Mandragouras , Amy E. 

(B) REGISTRATION NUMBER: 36,2 07 

(C) REFERENCE/DOCKET NUMBER: GTN-001CPPC 



(x) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)227-5941 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



76 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(iil MOLECULE TYPE: DNA (genomic) 



<iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .519 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



ATGAAAGGYC CYATCCTATG GCCGGCGTTT TCTCAATTTA GCGATCAAGA TTTGAGCGAT 60 

ATTGTGGCGT ATCTCACTTC TATTTTGCCT AAAAATTTGA GCGATAAGGA AGTGTTCGCG 120 

CAAAGTTGTC AAAGGTGCCA TAGCCTGGAT TATGCTAAAG ATAAGGCCTT TAGCGATCCT 180 

AAAGATTTAG CCAATTATTT AGGCTCTCAT GCGCCTGATT TGTCCATGAT GATTAGGGCT 240 

AAGGGCGAAC ATGGCTTGAA TGTTTTCATC AACGATCCGC AAAAGCTTTT GCCTGGCACA 300 

GCCATGCCTA GAGTGGGATT GAATGAAAAA GCTCAAAAAC AAGTCATTTC TTATTTGGAA 3 60 

AAAGCGGGCG ATAGGAAAAA GCATGAAAGG AATACTTTAG GGATTAAGAT CATGATTTTC 420 

TTTGCGGTGC TGTCGTTCTT GGCTTACGCT GGAAAAGAAA AGTTTGGAGC GAAGTGCATT 480 

AAATTTAAAA AAGGGGGGAC ATGGTTTTAT GATTTTTAA 519 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .186 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 



ATGCAAGAAT TCAGTTTGTG GTGCGATTTT ATAGAAAGGG ATTTTTTAGA AAACGATTTT 60 

TTAAAGCTCA TCAATAAGGG GGCTATTTGC GGGRCGACGA GTAACCCTAG TTTGTTTTGC 120 

GAAGCGATCA CAAAAAGCGC GTTTTATCAA GATGAAATCG CTAAAMCTCA AAGGCAAAAA 180 

AGCTAA 186 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1. . .861 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

TTGAASCCGA TGAAAGTGAT TCAAGTTTTT TTATTTTCCA ACCCTTTTTG CGCGATTGTG 60 

CCTAACACGG AGCCAGAACA TTTGGAGCAT TATGACCACC ATTTAGAACC CTTTTTCTTC 120 

GCTTATAAAT ATTTTTTAGA CCATGCTCAA AAAAGAGTGA TCTATAAAGA AGATCCTTTT 180 

TTAAAAAACT ATTCTAAAGA CGCCATTGTT TTAGAAAAAA AAGACATTTA TAATATCCAA 240 

TACATTTTAA AAGACGGAGA GCCTTACACT TCGTTTGAAT TGAAAAATTT GGGGGCTTTT 300 

TTGGTGTGGG GGTTAGGCGA ACATAACGCC ACGAATGCGA GTTTGGCGAT TTTAAGCGCT 360 

TTAGATGAAT TAAATTTAGA AGAAATTAGA AATAATTYAT TGAATTTTAA AGGCATTAAA 420 

AAACGCTTTG ATATTTTGCA AAAAAACAAT CTCATTCTCA TTGATGATTA CGCCCACCAC 480 

CCTACTGAAA TTGGCRCCAC TTTAAAAAGC GCTAGGATTT ATGCCAATTT ATTGAATACG 540 

CAAGAAAAAA TTATAGTGAT CTGGCAAGCG CACAAATACT CTCGCTTAAT GGACAATTTA 600 

GAAGAATTTA AAAAATGTTT TTTAGAGCAT TGCGACAGGT TGATCATTTT ACCCGTTTAT 660 

AGCGCGAGTG AAGTTAAAAG AGACATTGAT TTGAAAGCCC ATTTTAAGCA TTATAACCCC 720 

ACCTTTATAG ACAGGGTGCG TAAAAAGGGG GATTTTTTAG AGCTGTTAGT CAATGATAAT 780 

GTGGTAGAAA CGATTGAAAA AGGCTTTGTG ATAGGCTTTG GAGCGGGGGA TATTACCTAT 840 

CAGTTGAGAG GCGAAATGTA A 861 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 
<C} STRANDEDNESS : double 
(DJ TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

TTGTTGCTTT TCTTTCTTTT GAARGGCGTC GTTTTTTCTT TGGGCTTTTT TTCCTTTTTT 60 

GAAGAAGTCT CTGGCTCTTT TGRAGCTGTT TCTTTGARCG TGTTAGCGTT AGTCATGGGG 12 0 

TCTAGTYCTG GGTTAGAAGA ATTCTGTGTC TTAGAAGAGC TTATAAATTC AGGGCTATCA 180 

GTATAG 186 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .369 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

ATGGGTTTTT TAAAAGTTTT AAAACATGAC GCTTTAGGGC AAGTAGGGAA TATTGTTATA 60 
GGGAATTTCT TAATAACGCT CACTGTTTTA GCGGTTTGTT TTTCCTCTCA AAGCGCTGAA 120 
GAAACGACCA TGCTCACCCT AAGCTACACG CTCTTTTTTA TTTTAGGGGC GTTTTTATTA 180 

TTAAGCTTTT CCTTACCCAT TAGTTTGGAA TCTTTATTGC TCCCTAAAAT CTTGCTCCCC 300 
AKGGTGTTTT TTTATCTTCA GTTTGTTCTG GTTTGTGGCG AGCGTGCGTT TGGGCTATTA 360 
CCTTTTTAA 369 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1. . . 564 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

ATGTTAAAAA CCCACTTAAG CAGCGCTAGG GGCGTTGTGG TGTTGTCTAA GATTTTACCG 60 

GTGAACGTGG TGTTAATGGT GAGCGTGCGC TTGTTTGAAA AGGAATTAAA ACGCAAACCT 120 

TACTACATCA TTGCGAGCGC ACACAGCGAT GAAGGTTTAG AAAAATTAAA AAAATTWGGG 180 

GYTGATATGG TGGKTTYCCC TACAAAACTY ATGGCGCAGA GAGTGAGCGC GAATKGMTGG 240 

TGYKTCCTGG ATATGGAAAA TATCTTAGAG CGTTTTATCA ACAAAAAAGA CACGCTTTTA 300 

GACTTAGAGG AAGTGATTGT CCCCAAAACC AGCTGGCTTG TGTTAAGGAA ATTAAAAGAA 360 

GCCCATTTTA GAGAGATCGC TAAAGCCTTT GTGATTGGTA TCACTCAAAA AGATGGCAAA 420 

TACATCCCCA TGCCTGACGG GGAAACGATT ATTGCAAGCG AATCCAAGCT ATTGATGGTT 480 

GGCACTTCAG AAGGCGTTGC GACCTGTAAG CAACTCATTA CTAGCCACCA AAAACCAAAA 54 0 

GAAGTGGATT ACATTTCATT GTGA 564 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .582 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

GTGGGGAGCT TTTTATTCGT GGGGCCTAGT GGGGTAGGGA AAACAGAATT GGCTAAAGAA 60 

TTGGCCTTGA ATTTGMATTT GCATTTTGAA CGCTTTGACA TGAGCGAATA CAAAGAAGCC 120 

CATAGCGTGG CAAAGCTCAT CGGAAGTCCT AGCGGTTATG TGGGGTTTGA GCAAGGGGGG 180 

TTATTGGTGA ATGCGATTAA AAAGCACCCG CATTGTTTGC TGCTTTTAGA TGAGATAGAA 240 

AAGGCCCACC CTAATGTGTA TGATTTGTTG TTGCAGGTGA TGGAKAACGC CACTTTGAGC 300 

GATAATTTAG GCAACAAGGC GAGTTTTAAG CATGTGATAC TGATTATGAC KKCAARTGTG 360 

GGGAGTAAGG ATAAGGACAC GCTAGGGTTT TTTAGCACTA AAAACGCCAA GTATGATAGA 420 

GCCGTTAAAG AGCTTTTAAC CCCTGAATTC CCATCCACAA TTCATGCCAT CCTCCCCTTT 490 

AACGCGCTCA GTTTGGAGGA TTTTGAAACG CATTGTTTCT GTGGAATTGG ACGGGTTAAA 540 

AGCCCTAGCA CTAGAGCAAG GCGTGATCTT AAAATTCCAT AA 582 

(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iiij HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .909 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 



ATGGCTTTTC 
ACTCAAAACG 
GCGGCTGATG 
TTGGGTCAAG 
GCTATGGATG 
CAAGATGGGC 
CAAGGTTTGG 
CAATTCACTA 
ATCGGCTCTA 
ACGGCTTCTG 
TTAGAGAGCG 
ATCAATAAAA 
GATGTGGCGG 
AATATCGCAG 
GTCACTTCAG 
AGTATAGGT 



AGGTCAATAC 
CACTTAAAAC 
ACGCATCAGG 
CGATTGCCAA 
AGCAATTAAA 
AAACTACGGA 
ATAATATCGG 
ACAAAGAATT 
CCACTTCCGA 
GGGATATTAG 
TAAAAGTTTC 
ACTCTAACCG 
TCCAGTCAGG 
ATATTAAGMR 
AAACCGGTGT 



AAATATCAAT 
TTCATTGGAG 
CATGACGGTG 
CACGAATGAC 
AATCTTAGAC 
ATCTCGTAAA 
TAACACAACG 
CCAAGTAGGG 
TAAAATCGGT 
CTTGACTTTT 
TAGTTCAGCA 
AACAGGGGTT 
AAGTTTGAGT 
AAACGACTCA 
GGWAGCTTAT 



GCGATGAATG 
CGATTGAGTT 
GCGGATTCTT 
GGCATGGGGA 
ACCGTTAAGG 
GCGATTCAAT 
ACTTATAACG 
GCTTATTCTA 
CAGGTTCGTA 
AAACAAGTGG 
GGCACAGGGA 
AAAGCTTATG 
AATTTAACCT 
GACGGAAGGT 
ACGGATCAAA 



CGCATGTGCA 
CAGGTTTAAG 
TGCGTTCGCA 
TTATCCAGGT 
TTAAAGCGAC 
CTGACATCGT 
GGCAAGCGTT 
ACCAAAGCAT 
TCGCTACAGG 
ATGGCGTGAA 
TCGGCGTGTT 
CGAGCGTTAT 
TAAATGGGAT 
TAGTCACAGC 
AAGGACGCTT 



ATCCGCACTC 
GATCAATAAA 
AGCGAGCAGT 
TGCGGATAAG 
TCAAGCGGCT 
TCGTTTGATT 
ATTGTCTGGT 
TAAGGCTTCT 
CGCGTTAATC 
TGATGTAACT 
AGCAGAAGTG 
CACCACGAGC 
TCATTTGGGT 
RATCAATGCG 
GAATTTGCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
909 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

iix) FEATURE: 

(A) NAME/ KEY : misc_f eature 
(8) LOCATION 1. . .486 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

ATGTTTTTTA AAACTTATCA AAAATTACTG GGCGCGAGCT GTTTGGCGCT GTATTTAGTG 60 

GGCTGTGGGA ATGGTGGTGG CGGTGAATCG CCGGTTGAGA TGATTRCAAA TAGCGAGGGT 120 

ACGTTTCAAA TCCACTCCAA ACCACATACC ATTACTATTC AACCCCTCAA CCTTAATACA ISO 

GGTAATTGTG CTGTCAATTT TGTTCCAGTA AGTGAGACGT TTCAAATGGG TGTTTTAAGT 240 

CAAGTTACTC CAATCTCTAT ACAGGATTTT AAAGATATGG CAAGCACTTA TAAGATATTT 3 00 

GATCAAAAGA AAGGGTTGGC AAACATAGCA AATAAAATTT CTCAATTAGA GCAAAAGGGT 3 60 

GTGATGATGA AACCTCMAAC CCTTAATTTT GGAGAAAGTT TAAAAGGCAT TTCTCAAGGG 420 

TGCAATATTA TAGAGGCAGA AATACAAACC GACAAAGGCG CTTGGACTTT TAACTTTGAT 4 80 

AAATAA 486 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_£ eature 

(B) LOCATION 1. . .276 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GTGTGTTTAG GGCTAGCTGA TGTGATGGTG GTTTTAAGCT TGCACCTCAA CCTAAACCCC 60 

ACCAACCCTA AATGGCTCAA TAGGGACAGG TTGGTTTTTA GCGGAGGGCA TGCGAGCGCG 120 

TTAGTGTATA GTTTGTTGCA TTTGTGGGGC TTTGATTTGA GTTTAGACGA TTTAAAGCGT 180 

TTCAGGCAAT TACACTCTAA AACCCCAGGA CACCCTGAAT TACACCACAC CGAAGGCATT 240 

GAAATCACCA CASCCTTTAG GGCAAGGTTT TGCTAA 27 6 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .561 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 11 

ATGACAACAC CGATGATTAT TATTTCCCTA GAAATGGGGT TATCTTTAGT TCCTATGCGA 60 

CAATGTCTGG TTTGCCAAGC TCTGGCACGC TCAATTTCTT GGAACGGGTT AGGCGGGAAT 120 

GTCCGTAACA CCAAAGTTTA TGGTAAATTC GCCGCTTACC ACCATTTGCA AAAATATTTA 180 

TTGATAGATT TGATCGCTCG TTTTAAAACG CAAGGGGGCT ATATCTTTAG GTATAACACC 240 

GATGATTACT TGCCCTTAAA CTCCACTTTC TACATGGGGG GSGTAACCAC GGTGAGAGGC 300 

TTTAGGAACG GCTCAATCAC ACCTAAAGAT GAGTTTGGCT TGTGGCTTGG AGGCGATGGG 360 

ATTTTTACCR CTTCTACTGA ATTGAGCTAT GGGGTGTTAA AAGCGGCTAA AATGCGTTTA 420 

GCGTGGTTTT TTGACTTTGG TTTCTTAACC TTTAAMACCC CAACTAGGGG GAGTTTCTTC 480 

TATAACGCTY CCACCACGAC GGCGAATTTT AAAGATTATG RCGTTGTAGG GRCTGRGTTT 540 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

ATGCAAGCGT TAAAATCATT GCTTGAAGTG ATTACAAAAC TCCAGAATCT AGGCGGCTAT 60 

TTGATGCATA TAGCTATTTT CATCATTTTT ATTTGGATTG GAGGRCTTAA GTTTGTGCCT 120 

TACGAAGCTG AAGGGATCGC CCCTTTTGTG RCCAACTCCC CTTTCTTTTC TTTCATGTAT 180 

AAATTTGAAA AACCTGCATA CAAACAACAC AAAATGTCTG AATCCCAATC CATGCAAGAA 240 

GAAATGCAAG ATAACCCTAA AATCGTTGAA AACAAAKAAT GGCATAAAGA AAACCGCACT 300 

TCATTTAGTG GCTGA 315 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .549 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

ATGCAGTTTG AAGAAATGAA AGAATTAGCC CATCAAATTG GCGTGTTTTA CCATGTTGGC 60 

GTTGATGGCA TCGCGCTCTT TTTGTTGCTC TTAAACGCTA TCGTGGTGTT ATTGTCCGTG 120 

GTTTATGTCA AAGAGCGTCG TAAAGACTTT GTGATTTGTC TTTTATTGTT AGAMGGGATC 180 

TTAATGGGCG TGTTTTCTTC TCTTAATGTG ATCTTTTTCT ACGCTTTTTG GGAAATCTCG 240 

CTCTTGCCGG TTTTATACCT CATCGGTCGT TTTGGCCGTA ATAACAAAAT CTATTCTGGC 3 00 

ATGAAGTTTT TCCTCTACAC CTTTTTAGCG TCGTTGTGCA TGCTTTTAGG CATTTTATAC 360 

ATCGGGTATG ACTACGCCAA TAATTACGGC ATGATGAGTT TTGATATTTT AGACTGGTAT 420 

CAGTTGAATT TTTCTAGCGG GATTAAAACC TGGCTCTTTG TAGCTTTCTT AATAGGGATT 480 

GCGGTTAAAA TCCCGCTCTT TCCCTTCACA CATGGCTGCC TTATGCGTAT TCTAACGCCC 540 

CCACTCTAG 549 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
<B> LOCATION 1. . .351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

GTGAAAAAAT ACGCTGAAGA TTTTATCACC AAAGATGAAG TGAAATCCCT TTTAGAGCGC 

TTGGCTAAAG ACTATCCTAC GATTGTAGAA GAGAGTAAAA AAATCCCCAC CGGTGCGATC 

CGCTCAGTCT TGCAAGCCTT GTTGCATGAA AAAATCCCCA TTAAGGACAT GCTCACTATT 

TTGGAAACGA TTACTGATAT TGCTCCATTG GTTCAAAACG ATGTGAATAT CTTAACCGAA 

CAAGTGAGGG CGAGGCTTTC YAGGGTGATC ACCAACGCTT TTAAATCTGA AGACGGGCGT 

TTGAAATTTT TAACCTTTTC TACCGATRGC GAACAATTTT TKGCTCAATA A 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .720 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

ATGATGAAAA ACAAACGCTC TCAAAATAGC CCTTATGTAA CGCCTGACAA CCCTTATCTA 60 
ACGCTAGAAA AAGCTTTAGG GTATTCTTTT AAAGACAAGC GTTTATTGGA GCAAGCCTTA 120 
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ACGCATAAA7 CATGTAAGCT CGCTTTAAAC AATGAGCGCT TGGAATTTTT GGGCGATGCG 180 

GTGTTGGGCT TGGTGATAGG GGAGCTGCTA TACCATAAAT TCTRTCAWTR CGATGGGGGC 240 

AAACTCTCTA AATTAAGGGC TTCTATTGTG AGCGCGCATG GTTTCACTAA ATTAGCGAAA 300 

GCGATTGCTT TACAAGATTA TTTGCGCGTT TCTTCTTCTG AAGAAATTTC TAAGGGGAGG 360 

GAAAAACCC? CTATTCTRTC AAGCGCTTTT GAGGCTTTAA TGGCTGGGGT GTATTTAGAA 42 0 

GCAGGGTTAG CTAAGGTGCG TAAAATCATA CAAAATTTAC TCAATCGTGC TTACAAGCGT 480 

TTGGATTTGG AGCATTTGTT TATGGATTAT AAAACCGCTT TGCAGGAATT GACCCAAKCK 540 

CAGTTTTGCG TGATCCCCAC TTACCAATTA CTCCAAGAAA AAGGCCCCGA TCACCATAAA 600 

GAATTTGAAA TGGCTCTATA CATTCAAGAT AAAATGTATG CGACCGCTAA AGGCAAGAGT 660 

AAAAAAGAAG CCGAACAGCA ATGCGCTTAT CAAGCGCTTC AAAACTTAAG GAAGCCAAAT 720 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

TTGCTGGTCT TACTCAATCT AAAGAWTACG CCGAATTTGA TGTGGCCTTT AGATATTATT 60 

GTGGTTGTGG CATGGGTGTT ATGGGGGGTT AATATGTTTG GGAGCATGAG CGTTAGAAGA 120 

GAGAATACTA TTTATGTGTC TTTGTGGTAT TACATCGCTA CTTATGTGGG TATAGCGGTG 180 

ATGTATATCT TCAATAACCT TTCTATCCCC ACCTATTTTG TCGCTGATAT GGGGAGCGTT 240 

TGGCATTMTA TTTCTATGTA TTCAGGCAGT AATGATGCGC TCATTCAATG GTGGTGGGGG 300 

CATAATGCGG TCGCTTTTGT CTTTACGAGT GGGGTGATTG GCACAATTTA TTATTTCTTG 360 

CCTAAAGAGA GCGGCCAGCC TATCTTTTCT TACAAACTCA CTTTGTTTTC TTTTTGGAGT 420 

TTGATGTTTG TTTATATTTG GGCGGGCGGG CACCATTTGA TCTATTCCAC CGTGSCTGAT 480 

KGRGTACAAA CCCTTTCTAG CGYGTTTTCA GTGGTGTTGA TCTTGCCTTC GYGGGGGACA 540 

GCGATTAACA TGCTTTTAMC GATGAGAGGC CAATGGCACC AGYTCAAAGA AAGCCCTTTG 600 

ATCAAGTTTT TAGTTTTAGC CTCAACTTTC TACATGCTTT CCACGCTAGA AGGCTCCATT 660 

CAAGCCATTA AAAGCGTGAA CGCTTAG 687 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .489 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
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ATGAAAGCAC CCTCCCAAYA GGATTTAAAA AAAATCTTAG GGATTGAAGA AGTCATAATS 60 

STATCCACAA GCCCCATGGA ATTACGATTG GCCAATCAAA AGCTAGGCAA TCGTTTCATT 12 0 

AAAACCTTAC AAGCCATGAA CGAATTAGAC ATGGGTGCAT TTTTTAACGC TTACGCTCAA 180 

ACAACCAAAG ATCCCACCCA TGCCACTAGC TATGGCGTTT TTGCGGCGAG TTTGAATATG 240 

GAATTGAAAA AGGCTTTAAG GCATTATCTT TATGCGCAAA CTTCTAACAT GGTGATCAAC 300 

TGCGTTAAAA GCGTCCCCTT ATCCCAAAAC GACGGGCAAA AAATCTTATT GAGCTTGCAA 360 

AGCCCTTTTA ACCAGCTCAT AGAAAAAACC CTAGAACTAG ACGAAAGCCA CTTGTGCGCA 42 0 

GCAAGCGTTC AAAACGACAT TAAGGCGATG CAGCATGAGA GTTTATACTC GCGCCTTTAT 480 

ATGTCTTGA 4 89 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 
{ B ) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
\u) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



ATGGCGTTTA TTCTAACCAC AAACCTATTT ATCAAGAGTT TTACGAACTC AATTCGCATA 60 
ACGGGTTGTA TTATCAGCCC TAATGTGTTT TTTGCTTATG AATTTTGCGC GTTAGGGTTT 120 
AGAAAAGGGG GGTTAATTTT GGATAATTTT TCTAAATTCG TGAGCCACAG GTTGCAATAA 180 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc^f eature 

(B) LOCATION 1. . .747 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



GTGCGTTTTT 
GATAGCGCTT 
GTAACCACCC 
TTAGTGATGA 
GGCACGCAAC 
TTTATCATGG 
AAAAAGATTT 
CTTAAAAACA 



TCATTTTTTT 
TACCTAGCGT 
TTAATGTCAT 
CGAGTTTCAC 
AAACCCCACC 
AACCTAGCTT 
CTTACACCGA 
CACGAGAAAA 



AATTCTCATT 
CAATCTCTCT 
CGCCTTACTC 
CCGTTTGATC 
CACTCAAATT 
GAAAAAGGCT 
AGCGTTTGAA 
AGATCTAGCC 



TGCCCTTTAA 
TTAAACGCTC 
ACGCTTTTGG 
GTGGTGTTTT 
CTAGTCTCGC 
TATGATACAG 
AAAAGCACTC 
CTTTTTTTTA 



TATGCCCCTT 
CTAGTGATCC 
TTTTAGCCCC 
CTTTTTTAAG 
TCTCTTTGAT 
GGATTAAGCC 
TGCCTTTCAA 
GGATTAGGAA 



AATGAGCGCG 
TAAACAACTC 
ATCGTTGATT 
GACCGCTTTA 
ATTGACTTTT 
TTATATGGAT 
GGAATTCATG 
TTTGCCTAAC 
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CCTAAAACCC CTGATGATGT GAGCTTGAGC GTTTTAATCC CGGCATTTAT GATAAGCGAG 540 

TTGAAAACAG CGTTTCAAAT CGGCTTTTTA CTCTACTTGC CTTTTTTGGT GATTGATATG 600 

GTTATCAGCT CTATTTTAAT GGCGATGGGT ATGATGATGC TCCCGCCTGT AATGATTTCT 660 

CTGCCTTTTA AAATTTTGGT GTTTATTCTG GTGGATGGGT TTAATTTATT GACCGAAAAT 720 

TTAGTGGCGA GTTTTAAAAT GGTTTAA 747 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

{iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .501 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

TTGTTGGTTA CTTTTTTGAA TGGGTTTGAC CCAAAAATCG CTAATTTAAG GAAAGCGTGC 60 

AATGTTTATA GYGTGGGGGT GATTTATATT GTAACCACCA ACACGCTCAA TATTTTAAGT 120 

TGTGAGAGTT TTGAAATTTT AGAAAAAAGA GAGCTGGATA CAAGCGGCGT TACTAAAACT 180 

TCCACGCCGT TTTTTTCTAG GGTTGAGGGC ATTGATGCAG GCACGCTAGG GAAACTTTTT 240 

TCAGGCAGTC AATCTAAAAA TTACTTCGCT TACTATGACG CTTTAGTGAA AAAAGAAAAA 300 

CGAAAAGAAG TAAGGATTGA AAAGAAAGAA GAAAGGATTG ATGCTAGAGA AAATAAACGA 3 60 

GAAATCAAGC AAGAAGCCAT TAAAGAGCCT AAAAAAGCCA ATCAAGGCAC AGAAAACGCT 420 

CCCACTTTAG AAGAGAAAAS CTACCAAAAR GCAGAGCGAA AATTTGACGC TAAAGRAGRA 480 

AGGMGATCGT TCAAGRGATG A 501 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .381 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

ATGGAAAACA GCACACTTTA TATTGTTATT GCCGGCTTAT GGCTTGCTGT AGGCTTTGGA 60 

ATCTTTTTAA AGAAATTAGA CATGCCCGTT ATCATTGGCT ACATTTGCAC AGGAACGGTC 120 

TTAGCGGCTT TTTTTAAAAT TAATGATTTT AATTTGTTGT CTGATATTGG TGAATTTGGT 180 

ATCGTCTTTT TAATGTTTAT GATAGGCATT GAGTTTAATT TTGACAAGCT CAAGTCCATC 240 

AAACAAGAAG TGCTCGTTTT TGGGCTTTTA CAGGTTGTTT TATGCGCTTT AATCGCTTTT 300 

TTATTGGGGT ATTTTGTTCT GGGTCTTTCG CCCATTTTTT CCCTTGTTTT AGGCATGGGG 3 60 

CTTTCACTCT CTTCAACCGC C 381 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
TTGTTACTCA TGCTTAATAA GCCAAAGCCT TTATTTTTGM CTCTTGGTTA A 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .1053 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



ATGGCATTAA 
CAAGAATTAT 
GATAACGTGT 
AAAGAGAGCG 
AAGATTAGCA 
AGCTATGGGG 
AATATCGCTA 
AAGCATGAAA 
CAAAAACACC 
GAGATTTATC 
ACCACRATCG 
GTTAAAAACC 
TTTGCGTGGA 
ACCGTGAATA 
TCTTATTTAG 
GCGATTKCGA 
GGGAGCGTGC 
GATGTGTTGG 



GGGTATTATT 
CATCTATACA 
GGTTGAAAAA 
TGAGGCAGGA 
CCTTAGAGCA 
TCAATCCTTT 
ACCCTATTGC 
ATCTTAAAAA 
AGCTTTTAAA 
AAACTCAAGC 
GGATTTTCCA 
AGCTTTTTAA 
TTTTAAAAAT 
AAGCCATTAA 
AAAACGTTAC 
TGAAGGATTT 
ATGTGGGCGA 
ATACTTCTAA 



ATTCTTTTGT 
AAAACAAATG 
ATTTGAAAAC 
ATTAAGGCGC 
TACCTTAAAG 
TAAGGACTTG 
GATCATTGAT 
TAACCAGACT 
TCAGTGGCAC 
CAAACGCTTA 
AAAAGACAGC 
ATTGGTTTAT 
CATTTCCAGT 
CTTCGTGAAT 
TTACTTGGTA 
ATTCATGAGC 
TAGGGTGCGT 
TGTTGTACAT 



TTTTTGTTTT 
GCTTTGGTGG 
TATAAAATTT 
TTAAAAAACA 
GCTTTAGAGT 
ATAGAGCGCC 
GGCATTTCTT 
TCTTTAGGAG 
GCTTTGGATA 
GAATTGCAAG 
GATGAAGCTA 
GTGTTTTTAG 
AAATACATTG 
GTGAGCGTGA 
ACGGTTTTAG 
TTGCTCGGGT 
ATCGCTAAGG 
TAA 



TGCAAGCAGA 
ATAAAAAACT 
ACAACCAAAT 
AAAAAAGCAA 
CCCAGCAAAA 
CCAATATCCC 
TCATCAAGAG 
AAGTTTTAAA 
AAAGCGCGAA 
GGGCTCAAAA 
TAAGCATTGT 
CGGCCCTTTT 
AAAATAATGA 
TCGKTKKAAT 
GCTTTGCGAG 
GGTTTATCAT 
GGACGGATAT 



AGATAAGAGC 
CGCCAAAGAC 
TTATACTGAA 
GGATTTATTA 
AATG TTTGAA 
CAATATCCCT 
CATGCGTTTA 
GCTTTTAGAT 
ATTAAGCGAT 
CATTCTAAAA 
CAAATCTCAA 
GAGCGTGGTG 
GCGCGTCTAT 
CTTKCTTTTT 
CGCGGGCTTA 
TTTGATTGGG 
TTTTATTGGC 
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(2) INFORMATION FOR SEQ ID NO:24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ivi ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAjrfE/KEY: misc_feaLure 
(6) LOCATION 1 . . .300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

ATGAAAGAAC AAGAATGGGA TTTAAGCGCT TTATTTGAAA ATAAAGAAAG CGCAGAAGAA 
TTTTTAAAAA CCTTACAAAC AGAAGTGCAA GAATTTGAGA ACGCTTATCA AAATAACCTT 
AAGAA^TTAG ACGCTGCAAA ATTTGCCAAC ACTCTTAAAC ATTACGAAAA TTTGTCAGAA 
AAGATCTCTA GAGCGATGGC TTACGCCAAT TACTTTTTGC CAAGAACACT AAAGAAGCGA 
AGTTTTATTC GCAATGCAAA TGGCTTGTGC AAATATCCAA CAACACCTTT TATTCTTTGA 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 base pairs 

(B) TYPE: nucleic acid 

{ C J STRANDEDNESS : doubl e 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv> ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc„feature 

(B) LOCATION 1. . .237 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

TTGCGCGTGG GCATGTATGA AGTGTGTAAC CATAAAGACG GCACCGCTTA TCATTCCACA 
AGAGGTTCTA AGGTTACCTT AGCGTGTAAA ACCGGCACCG CGCAAGTCGT AGAAATCGCT 
CAAAACATCG TCAATCGCAT GAAAGAAAAG GATATGGAAT ATTTCCATCS MTCCCATRCG 
TGGATTACGR CATATCTTTR CCCTATGAAA AACCCAAATA CGCTATCACT ATTTTAG 

(2) I INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .159 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

TTGGGTTTGG TGWRCGGRAT TTCTCTCTTG CATTTGAGTT TGGAGCAAAA AATCAGCGTG 
TTTCTTGGRC YCAATTTAAT GCTTTAYCCG GTCAYAGAGG TGCTTTTTAG TATCCTTAGG 
CGCAAAATAA AACGCCAGAA AGCCACCCAT GCCGGATAA 

(2) INFORMATION FOR SEQ ID NO: 27: 



60 
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<i) SEQUENCE CHARACTERISTICS: 

lAf LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
(BJ LOCATION 1. . .1134 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



TTGGCTCAAC 
CAATTTGCCT 
TATATGGATA 
AAGATTGAGC 
AAAACAAATG 
AAAAACTTAG 
CTAGCAGATT 
CTAGATTATC 
GGGCTTGTCG 
TTTGACAATA 
TCAAGTTTCT 
ATCCAAGAAG 
TTAGGCGATC 
AACGCTATTG 
GATACGATTA 
TCTGATTACC 
CTGGATAGGA 
TTGAGAGGCA 
GAAATTGACA 



CGGTTCAGGT 
ATAATTTTCC 
AAGATTATGG 
TAGGGACAGA 
CAACAATGGA 
CAACTAAAAG 
CTAATACGCT 
CTATCACTCA 
CGCGATTAGC 
AAGCAACAGA 
TAGACAATAA 
CAATGGATGG 
CAAAGGTCGC 
TTAGACTTGC 
GAGAACAATG 
AAAGATTAGC 
AAATCCTCTA 
TTCCTAAAGA 
AGATTATCCA 



AAGAACAGTG 
TGCTAATATC 
CGCTTATGGG 
TACAGGATTA 
GCAAAAACAA 
CGATGAAAAA 
TGCAGCGGCA 
ACTTATTAAT 
GCCTTTTTGC 
TCGCTTAGAT 
TGACGTTTCG 
GCGTAGATTT 
TTATTTTGTG 
GACTCAAAGC 
CCCTACAAAG 
CAATGTTACA 
CAAACAGGAT 
ATATTTGAAA 
AAACCATAGT 



TTTATGTCCA 
AGCAAAGACA 
AATATTGTTG 
AATCCTTTTG 
ACAGCTATTT 
GATGAAAATG 
GTAACCAACC 
GCTTTCGGGA 
AAATCAACCA 
TTTTCAAAAA 
CCTTTTATTT 
GTCTTAGATA 
AGAGACATGC 
ATCACTGATC 
ATTTTTTTGA 
GAAAAAGAAT 
GGAAGCCCTA 
ATTTTATCCA 
ATCATAGATA 



TGACTCTAAA 
AGCAAAAGCT 
CAATGGGTGG 
CTTGGGCAGC 
CTGTTGTCAA 
GCAACAGCAT 
TTATCACAGG 
AAGACCACAA 
ATGGTGAATT 
CGATTATTGG 
GTTTTTACCT 
TTGATGAAGC 
TAAAAACTGC 
TTTTGGCTTG 
GAAACGATGG 
TTGAAATCAT 
GCGTTATCGC 
CAGATACTGT 
AATATCAGCC 



CGCTATGGGG 
CACTATGGTT 
GGAGTATGTC 
TTGTGTGCAA 
AGAGCTTGTG 
CTCTTTTAGC 
AGATATGAAC 
TGATCCTAAT 
TCAATGGCTT 
CGTTGATGGG 
TTTCGCTCGT 
GTGGAAATAT 
AAGGAAAAGA 
CCCTATTGCT 
GGGTAATCTT 
CACTAAGGGG 
TAGTTTTAAT 
ATTTGTCAAA 
TTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1134 



(2) INFORMATION FOR SEQ ID NO:28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .465 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

ATGAAATTGG TGAGTCTTAT TGTAGCGTTA GTTTTTTGTT GTTTTTTAGG GGCTGTAGAG 
TTGCCTGGAG TTTATCAAAC TCAAGAATTT TTATACATGA AAAGCTCTTT TGTGGAGTTT 
TTTGAGCATA ACGGGAAGTT CTATGCCTAT GGTATTTCTG ATGTGRATGR CTCTAAAGCC 
AAAAAAGACA AACTCAATCC TAACCCAAAG CTAAGGAATC GCAGCGATAA AGGCGTGGTG 
TTTTTAAGCG ATTTGATTAA GGTTGGGGAA CAATCTTATA AAGGCGGTAA GGCGTRTAAT 
TTTTRTGACG GCAAGACCTM CCATGTGAGA GTCACTCAAA RTTCAAACGG GGATTTGRAA 
TTCACTTCAA GCTATGKCAA ATGGGGGTAT GTGGGCAAAA CCTTCACCTG GAAACGCC7G 
AGCGATGAAG AAATCAAAAA TCTAAAGCTC AAGCGTTTTA ACTGA 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
(8) LOCATION 1. . .24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29 

TTGGAGACCC TATTCTTGGT ATAG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
ATGSACACAC ACGACAGGCG CAAGTTAAGA ATTARCCTTA CACAAACGAC GACTTTAGTG 



60 
120 
180 
240 
300 
360 
420 
465 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



90 



GCCACTATTG GCTCAAACGC CCCTTATATC GGTCTTTTAG GGACGGTTAT GGGGATCATG 120 

CTCACCTTTA TGGATTTAGG CTCAGCTTCT GGCATTGACA CTAAGGCGAT CATGACTAAT 180 

TTAGCCCTTG CTTTAAAAGC GACCGGCATG GGGTTATTGG TAGCGATCCC TGCGATTGTG 24 0 

ATTTATAACT TGTTAGTGAG AAAAAGCGAG ATTTTAGTTA CCAAATGGGA TATTTTCCAC 300 

CATCCGGTTG ATACGCAATC CCATGAGGTT TATAGCAAAG CCTAA 3 45 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 



ATGCAGGATT TAGACAATAA CATGTCCTTA GACACCGCTC ACAACACGCT GAGTTCTAAC 60 

GGGAAAAACA TCACCATTGC CGGGGTGGTA AAAGCCTTAC AAAAAATTGG CGTGAGCGCT 120 

AAGGGGATGG TTTCAATCTT GCAAGCCCTA AAAAAAAGCG GCGCGATTAG CGCGAAATGG 180 

AGATACTATG ATAAACAACA ATAA 204 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . .267 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



TTGCACCCTT TAGCGGATGT CTTTGTGGTG AATGACAAAC GGYCCGTTTT AGCGATGGTA 60 

RCGATGTTGA TTGRCTCGTT AGCGAATATC TTTTTCAATT ACTTGTTTAT TTTTGKGTTG 120 

GAAGTGGGGG TTCAAGGCAG MGCGATAGTC ACCGTGATAG GGCATGCGAT AGGGGGTTTA 180 

GTCTTAATGC AGCATTTTTG GCGCAAAAAA GGGGAGTTGT ATTTTATCAA ACTGATTTTC 240 

TTTATCTTCA GTCATTTCTT CAGCTAA 267 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

ATGT'l'AAGGA AAAACATTTT AGCTTACTAT GGGGCGAATT TTCTCTTAAT CATCGCTCAA 60 

AGCTTGCCCC ATGCGATTTT AACCCCCTTG TTGCTTTCTA AAGGGCTTAG TTTGAGTGAA 120 

ATCTTGCTCG TGCAAACCTT TTTTAGTTTT TGCGTGCTGG TGGCTGAATA CCCAAGCGGC 180 

GTTTTAGCGG ATTTGATGAG CCGGAAGAAT TTATTCCTGG TTTCTAATGT GTTTTTAATC 240 

GCTAGTTTTT CGTTGGTGCT GTTTTTTGAT AGTTTTATCC TCATGCTTTT AGCGTGGGGG 3 00 

TTGTATGGTT TGTATAGCGC ATGCTCTAGC GGCACGATTG AAGCTTCACT CATCACAGAC 3 60 

ATTAAGGAAA ACAAAAAAGA TTTATCCAAG TTTTTAGCCA AAAACAATCA AATTACTTAT 420 

TTGGGCATGA TTATAGGGAG TTCTTTGGGA TCGTTTTTGT ATCTCAAAGT CCATGCGATG 4 80 

CTGTATGTCG TGGGGATTTT TTTAATCATG CTCTGTGCGC TAACAATCAT CATTTATTTT 540 

AAAGAAAAAG AAGGGGATTT TAAAAGCCAA AAAAATTTGA AACTCCTTAA AGAGCAAGTC 600 

AAAGGCAGTC TTAAAGAGCT TAAAGATAAC CCCAAGCTTA AAATTTTGTT AGTGGGGCAT' 660 

TTGATTACGC CTGTCTTTTT TATGAGCCAT TTCCAAATGT GGCAAGCGTA TTTTTTAAAA 720 

CAAGGCGTTA AAGAGCAATA CCTTTTTGTG TTCTATATCG CTTTTCAAGT GATTTCCATC 780 

CCTCATTCAT TTTTTAAAAG CCAAAAATTA KCAGCCAAAA AATCGCCCTG A 831 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

TTGTATCCGC CGGGATCTGT GGTTAAAATG GGCGTGGGGT TAAGCTTTTT AGAAAACCTT 60 

CATATCACAG AAAACACCAC TATCCCCACA CCGCCTTTTA TTGAAGTGGG CAAGCGCAAA 120 

TTCAGGGACT GGAAAAAAAC AGGGCATGGC AATTCTAATT TGTATAAAGC CATTAGGGAG 180 

TCCGTGGATG TGTATTTTTA TAAGTTTGGG CTTGAAATCT CTATAGAAAA MCTCTCTAAA 240 

MCCTTTAAGG RAAGTGGRCT TTGGGGAAAA AACGGRCGTT GA 282 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1. . .183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

ATGGCACACC ATKAAGAACA ACACGGCGGG CACCACCACC AYCACCACCA CACACACCAC 60 

CACCATTATC ATGGCGGCGA ACACCACCAT CACCACCACA GYTCTCATCA TGAAGAAGGT 120 

TG1 M 1*G C AGC A CTAGCGATAG TCATCATCAA GAAGAAGGTT GrTGYCACGG GYAYCACGAG ISO 

TAA 183 



(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

TTGGTTAAAA TAAGGTTATT TGATTTTACT ATAAGGTTGT TTAAACCTGA ATTTCACAT7 60 

TTTGATTTTT TAAAAGGGAT TAGAGTTCTT ATGATTGAAT GGATGCAAAA TCATAGAAAG 120 

TATTTAGTGG TTACGATATG GATAAGCACG ATCGCTTTTA TTGCCGCCGG AATGATAGGT 180 

TGGGGGCAAT ACAGCTTTTC TTTAGATAGC GATAGCGCTG CCAAAGTGGG ACAGATTAAG 240 

ATTTCTCAAG AAGAATTAGC CCAAGAATAC CGCCGCCTTA AAGACGCCTA TGCTGAGTCT 300 

ATCCCTGATT TTAAAGAACT CACCGAAGAT CAAATCAAAG CCATGCATTT AGAAAAAAGC 360 

GCGCTAGATT CGCTCATCAA TCAAGCTTTA TTGAGGAATT TCGCTTTAGA TTTAGGGCTT 420 

GGTGCTACCA AGCAAGAAGT GGCCAAAGAG ATCAGAAAAA CGAACGTTTT TCAAAAAGAT 480 

GGCGTTTTTG ATGAAGAATT GTATAAAAAT ATCTTAAAAC AAAGCCATTA CCGCCCCAAG 540 

CATTTTGAAG AAAGCGTTGA AAGGCTTTTA ATCCTTCAAA AAATCAGCGC TCTATTCCCC 600 

AAAACCACCA CCCCTTTGGA GCAATCCAGT CTATCGCTTT GGGCAAAATT GCAAGACAAA 660 

TTAGACATTC TTATCCTAAA TCCTAATGAT GTTAAAATCT CTCTCAATGA AGAAGAGATG 720 

AAAAAATATT ATGAAAACCA TAGAAAGGAT TTTAAAAAGC CCACAAGCTT TAAAACACGC 780 

TCTTTATATT TTGACGCTAG TTTAGAAAAA ACTGATTTGA AAGAGTTGGA GGAATACTAC 840 

CATAAAAACA AGGTGTCTTA TTTGGACAAM AGMGGGGAAA TTACAGGATT TTAA 894 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 
<B) LOCATION 1. . .273 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

ATGGTTAAAC ACTATCTTTT CATGGCGGTT TCGCAGGTCT TTTTCTCCTT CTTTTTAGTG 60 

CTGTTTTTTA TCTCTTCCAT TGTGTTATTA ATCAGTATTG CAAGCGTAAC GCTCGTGATT 120 

AAAGTGAGCT TTTTGGATCT GGTGCAACTC TTTTTGTATT CCTTGCCWGG AACCATTTTT 180 

TTTATTTTGC CGATYACTTT TTTTGCGGCT TKGCGYTTGG GGSTTTCAAG GCTTAGCTAT 240 

GACCATGAAT TGTTAGTGTT TTTTLTCrrT TAG 273 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

ATGAGTAAAA GAGCGATCCG TTTCCCTAAC AAGCTTTTTT CATACCCTAA ACCCAAAATA 6 0 

AAAGCGACAA ACACAAGCCA CACCGTTTTA TTCGCATACC CGCTCAAACC CCACGAAATG 120 

GCCTTATTAG CGCTCGCTAC CTCACTGCTC GCTCCAATTT TTAACGCTAT ACACAGCACT 180 

AACGCGCTCA ACGCTATCAA ACCTGATGGC ACCGGCTCTA AAATTAACCC TATAATCATG 240 

CCCATGAAAA TACAAAAATA A 261 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .426 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

GTGTATAGCC GTTTTTTTGC CAACCAGCAT GAATTTGACT TTGAAGCTCA AGGGGCGCTA 60 

GGGAGCGATC AATCAAGCTT GAATTTCAAA AGTACTCTAT TACAAGATTT GAATCAAAGC 12 0 

TATAATTACT TAGCCTATAG CGCCACAGCA AGAGCGAGTT ATGGTTATGA CTTCGCGTTT 18 0 

TTTAGGAACG CTTTAGTGTT AAAACCAAGC GTGGGCGTGA GCTATAACCA TTTAGGTTCA 240 

ACCAACTTTA AAAGCAATAG CCAATCACAA GTGGCTTTAA AAAATGGCGC GAGCAGTCAG 3 00 

CATTTATTCA ACGCTAACGC AACGTGGAAG CGCGTTATTA TTATGGGGAC ACTTCATACT 3 60 

TTTATTTGCA TGTGGGAGTT TTACAAGAGT TCGCTCACTT TGGATCGAAT GATGTGGCGT 420 

CTTTAA 42 6 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .558 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40 

ATGYATGAAA ATGGTAGGGG TGTACCTAAA GATTACAAGA AAGCGGTTGA ATATTTCCAA 
AAAGCTGTTG ATAACGATAT ACCTAGAGGG TATAACAATT TGGGCGTGAT GTATAAAGAG 
GGTAAGGGAG TTCC TAAAGA TGAAAAGAAA GCGGTGGAAT ATTTTAGAAT AGCTACAGAG 
AAAGGTTATA CTAACGCTTA TATCAACTTA GGCATCATGT ATATGGAGGG CAGGGGAGTT 
CCAAGTAACT ATGCGAAAGC GACAGAATGT TTTAGAAAAG CGATGCATAA GGGCAATGTG 
RAAGCTTATA TTCTCCTAGG GGATATTTAT TATAGCGGAA TGATCAATTG GGTATTGAGC 
CGGACAAAGA TAAGGCTGGT CCATTATAAA ATGGCGGCCG ATGTRAGTTC TTCYAGAGCY 
TATRAAGGGT TGTCAGAGTC YTATCSGTAT GGGYTAGGCG TGGAAAAAGA KWAAAAAAAG 
GCYGAAGAAT ACATGCAAAA AGCATGCGAT TTTGACATTG ATAAAAATTG TAAGAAAAAG 
AACACTTCAA GCCGATAA 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
TTGCTCAACA TGTGGGATGA AGCCAAGAAA GAAGGGATCA ACATCAATAC AGAAAAGCTC 60 
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TCTCAAGAAT TGGGGGTTGT GTGCGTGCCA ACAAGCGCGA GATYCAAAGA AGATCGCTTG 120 

AACACAGAGC TTTTATTAGA CGAAATTGTC AGGCTTTATT CTCAAAACAC TACAAACAAC 180 

GAAAACATCA AAGTCCCATC TCAAAGTTTT AAAGAGTCTT TAAAATACAG CCAGAGCGCC 240 

CAAAGAATCG CTAAATCAGT GATCAGTGAA AACAAACAAA ATGCGAGTTT TGAACACACT 300 

TATAAGATTG ATAAGATTTT TAATGCACCA GCGTTATGGG ATTTTCATTT TTTTDGGGTT 360 

TATGTTTATC ATCTTTTCTT TGAGCTTTTT AATAGGAGGG GGAGTGCRAA AAGCCCTTGA 420 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( i i i i HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 
{ B) LOCATION 1. . .582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

ATGCAAGAAG CGTTGTTGCG TTTTCAAGAG GGCTTTAAGG AGTGGGGTTA TCTTATTTTA 60 

TTTTTGTATT CTTTGGGGGG TGGGTATGTA GGGATTGTCA TCGCTTCCAT TTTGAGCGCT 120 

ACCACGCACG CTTTGGATAT AAAAATAACC ATTCTTGTCG CTTTTTTAGG GAATTTAATA 180 

GGGAGTGGGG CTCTTGTAAT CTTTGCCCGC TATCAAAAAA GAGAGTTTTT AAAGTATTTC 240 

CAAAAGCATA GAAGAAAGCT TGCTTTGGCG AGTTTGTGGG TGAAACGCTA CGCCTTGCTC 3 00 

ATGATTTTTG TCAATAAATA TCTCTATGGG ATTAAAAGCG TTGTGCCTTT GGCAATTGGT 3 60 

TTTAGCAAAT ACCCTTTAAA AAAGTTTTTA TGGCTTAATG TTTTTTCCAG TTTTTTGTGG 420 

GCGTTAATCG TGGGGAGCGT TTCTTTTCAA GCGAGCGATT GGGTGAAAAC GCTGTATGAA 480 

AGGCTTTCTC ATTACACTTC GTTTTTTGTC ATAAGTTTTG TTCTTATAGC GCTTTTAATA 540 

TGGTTTTTAT TGAAACGATA TTCGCGCAAA ATGGGKTTTT AA 582 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

ATGCGAAAGG GGCGTGTGAT GTTATGCGTG TTTGATATAG AAACCATTCC TAATATAAGC 60 

TTGTGTAAAG AGCATTTTCA ATTAAAAGAA GACGATGCGC TAAAAATCTG TGAATGGAGT 120 

TTTGAAAAGC AAAAAGAAAA AAGCGGGAGC GAGTTTTTGC CCCTTTATTT GCATGAAATC 180 

ATCTCTATTG CAGCMGTCAT TGGCGATGAT TACGGGCAAT TTATCAAAGT AGGGAATTTT 240 

GGTCAAAAAC ACGAGAATAA AGAGGATTTT GCGAGCGAAA AAGAGCTTTT AGAGGACTTT 300 
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TTCAAATACT TTAACGAAAA GCAACCGCGC CTAATAAGCT TTAAWGGCAG GGGTTTTGGA 3 60 

TATTCCCCTA CTCACGCTCA AAGCCCTTAA 3 90 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL; NO 

(iv) ANTI- SENSE; NO 

{ v i i ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME /KEY : misc_£eature 

(B) LOCATION 1. . .924 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

ATGGCTAAAA AGAAAATTGC GATCAGCTGT GGGGATATTC AAGGCGTAGG CTTAGAATTG 60 

ATCTTAAAAA GCCATAAGGA AGTGAGTGCA CTTTGTGAGC CGTTGTATCT CGTTCATAGC 120 

GAACTTCTAG AACGAGCCAA TCAATTGCTT GATAACGCTT ATGAAACTAA AACGCTTAAT 180 

GCGATCGCTA TTGATGCCCC TTTACCCTTA TTAAACTCTA GCACGATAGG CAAAGTCAGC 240 

ACTCAAAGCG GGGCGTATAG CTTTGAGAGT TTTAAAAAGG CTTGCGAGTT GGCGGATAGT 300 

AAAGAAGTGG ATGGCATTTG CACTTTGCCT ATCAACAAAC TCGCATGGCA ACAAGCTCAA 360 

ATCCCTTTTG TGGGGCATAC CGATTTTTTG AAACAACGCT ACAAAGATCA TCAAATTATT 420 

ATGATGCTTG GGTGTTCAAA ACTCTTTGTG GGGCTATTTA GCGACCATGT GCCTTTAAGC 480 

GCGGTTTCTC AACTCATTCA AGTGAAAGCG TTAGTTAAGT TTTTATTAGC GTTTCAAAAA 540 

AGCACTCAAG CCAAAATCGT TCAAGTGTGT GGTTTCAACC CCCATGCGGG CGAAGAGGGA 600 

TTGTTTGGGG AAGAAGATGA AAAGATTTTA AAAGCCATTC AAGAGAGCAA CCAAACGCTA 660 

GGTTTTGAAT GCTTTTTGGG GCCACTGCCC GCTGATAGCG CTTTTGCCCC CAATAAACGC 720 

AAAATAACCC CCTTTTATGT GAGCATGAGC CATGATGTAG GGCTAGCCCC TTTAAAAGCG 780 

CTCTATTTTG ATGAAAGCAT CAATGTGAGT TTGAACGCTC CCATTTTACG CGCTTCCACT 840 

GACCACGGCA CGGCGTTTGA TATTGCTTAT CAAAATAAGG CGAACCATAA AAGCTATTTG 900 

AACGCGATCA AATACTTGGC TTAA 924 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc„feacure 

(B) LOCATION 1. . .440 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

ATGAGCAGCG GGTTAATTTA CATTTCRTTA GAAGTCTTGG TARCGTGTTT GATCACCGCT 60 
CTAATCATGT ATTATGTGAT GAAAAAGATC TATTACGCTA GAGGGCAAGC CATTTTAAAA 12 0 

GGCGCTTCAG CCAAAGCTAA ATTAATGGAA TTTCAAGCGA AATCTTTCGT GGAAGCTGAA 180 
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GAAATGCGCA TGAAAAGCCA AGAATGCAAG TTGCAACAGC 
CAACTCCAAA CCCATTTTGA TAAAAAAGAA GCGCATTTGA 
AAAGAATTTG TAAGAGATGA AAAACGCTAT TTGGAAAAGG 
GAACGCCAAA TTTTAGAAMM AGAGAGGGAA AATTTTRRRR 
RRGSCTCRRG CCAAAGCGCT 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

ATGAATATCA AAATTTTAAA AATATTAGTT GGAGGGTTAT 
CATTTATGGG GGAAACAAGA CAATAGCTTT TTAGGGATTG 
GGGAATTATT CTAAAGCGGC GTCTTATTTT AAAAAAGCAT 
GGCTGCACGC AATTAGGAAT CATTTATGAA AACGGGCAAG 
AAAGCCCTAG AATATTATAA AACCGCATGC CAGGCTGATG 
TTAGGGGGGC TTTATGATGA GGGTTTAGGC ACGGCTCAAA 
GCTTACGCAA GGCATGCGTT TTAA 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47 

ATGGCGATAG CCATTAAGGA TTTATTGAGC GCTTATAAAG 
ATCAGCATGC CATCTAGCGC GGATTTGAAG CTCACTTTGC 
CCCTTATTTA GCGTTCAAGG CAGCGTTAAT TTGCAAGAAG 
ATCCCCCTTT ACACGCAAAG CGCTCAAATC AATTTGGACA 
ATCTACATAG ACACGATCCA CACGCGCTAT GCAAACATGC 
GCTTTAGATT TAGGTCAAAA AAACCTYTCY YKGGAKKCYY 



AATATGAAAA TAAGAATTTG 
AGCATTTAGA AGCGCAGCAC 
AAAAAAAAGA GCTTGAAAAA 
RSCAGCGCGC CTTTGTRRRG 



240 
300 
360 
420 
440 



TTTTTTTGAG 
GTGAAAGAGC 
GCAACGATGG 
GCACTAGAAT 
ATAGGGAAGG 
ATTATCAAGA 



CTTGAACGCC 
CTATAAAAGC 
GGTGAGTGAA 
AGATTATAAA 
GTGTTTTGGC 
AGCCMTTGAC 



60 
120 
180 
240 
300 
360 
384 



TCGTTTTACC 
AATTCTTAAA 
GCACTTTCTC 
TCGCCCAAGA 
KGGATTTAGA 
TAGGKCCATA 



TTTGGATAAA 
AAACACCGCC 
GCTCTATAAT 
ATACCAATAC 
CGCTAAAATC 
A 



60 
120 
180 
240 
300 
351 



(2) INFORMATION FOR SEQ ID NO: 48 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
{ B } LGC AT I ON I . . .245 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

ATGCCGGATA AYTTGCATTT GCACACCCTT TTATYTAAAT TCTTGCAACA ACGCTCTTTC 60 

AATTACCCTA ACCCTTTATG CGCGTTTATC CTTATTCTAT GCAACCTGCC TTTTATTTTA 120 

ATAAGCGTTT TGTTTCGCTT GGACGCTTAT GCGCTCATTG TGATTAGCCT AGTCTTTATC 180 

RCATGCTATT TAATAGGCTM TGCTTATTTG AATAGGCAAG TTTGCGCTTT AGAAAAGCGG 24 0 

GCGTTTTAA 249 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

TTGAAAGTAA CRAATCCGCA TTTGTTGGTG GTAATCCAAG ATTTAAACGC TCGCATCGCT 60 

TTAATGAAAC TCTTATTCCA AAACGTTAAG AGCGCGAACA AAGAATTGGT TTTTTGCAAT 120 

AAAGAAAAAC GCTTGATAAG GTCTTTTGAT GCACAAAAAG AATACGGCAT CACGCCTGTA 180 

GAAAATATTT TAAGCGTTTT AGACACCGCT ATGAATCCTA ACAGCGCGCT TGTGATAGAC 2 40 

AATCTCAACG AAGCGAAAGA ATTGCACGAC AAAGTAGGGG CGGAAAAGTT AAAATCGTTT 3 00 

TTAGAAAAAG CCCMTAGACA ACGAGCAGTA TTGCGTCATT TTTGCGCATG A 3 51 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 597 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . -597 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

ATGAAAGAAT CTTTTTACAT AGAGGGAATG ACTTGCACGG CGTGTTCTAG 

CGCTCTTTAG GACGTAAAAG TTTTGTGAAA AAAATAGAAG TGAGCCTTTT 

GCTAACATTG AATTTAACGA AAATGAAACC AATTTAGACG AAATTTTTAA 

AAACTGGGTT ATAGCCCTAA AAAAACTCTA GCAGAAGAAA AAAAAGAATT 

AATGTTAAAT TAGCGTTGGC GGTTATTTTC ACGCTTTTTG TGGTGTATCT 

GCGATGCTTA GCCCTAGCCT TTTACCTGAA AGCTTGCTTA CGATTAACCA 

TTTTTAAACG CTTGCTTACA GCTTATAGGC GCGCTCATTG TCATGCAT7T 

TTTTACATTC AAGGGTTTAA AGCCTTATGG CACAGACAAC CCAACATGAG 

GCCATAGGCA CAAGTGCTGC CTTAATTTCA GCCTGTGGCA ATTGTATTTG 

ATCATTATAC CGATCAGTGG TCTTATGGGC ATTATTATTT TGAAAGCGTG 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

GTGGGGATTG TGCCGGATAA TTTGTGGAAG CTCAAACGCT TCAATCAAGA CTGGCGCGTT 
GGGGACACGC TCATTACTGC TATTGGGCAA GGCTCTTTTT TAGCCACGCC TTTGCAGGTG 
TTAGCCTACA CAGGACTCAT TGCGACAGGC AAACTGGCAA CGCCTCATTT TGCTATCCAT 
AACCAACAAC CGCTCAAAGA CCCCCTGAAT AGGTTTTCAA AAAAAGAAGC TCCAAGCCTT 
GCGCGTGGGC ATGTATGA 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 



CGGGATTGAA bU 

AAATAAGAGC 120 

ACTCATTGAA 180 

TTTTAGCCCT 240 

TTCTATGGGG 300 

TCATAGTAAT 360 

GGGGAGGGAT 420 

CAGCCTTATC 480 

GTTTATACCA 540 

TGCGTGA 597 
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fA) NAME /KEY : misc_f eature 
(B) LOCATION I . . . 1032 

{xi} SEQUENCE DESCRIPTION : SEQ ID NO : 52 

ATGCAAAACT TATTGATACA AGCAGAAAAT GCAATCGCTC TACTTTTTTT GTTAAATGAC 60 

AAAAACCTAA AAGGAAAAAT AGATTTGATA TATATTGACC CTCCATTTGC TACAAACAAT 120 

CATTTTACTA TCACAAATGG TAGAGCAACC ACAATTAGCA ATTCTAAGAA TGGCGATATT 180 

GCTTATAGTG ATAAAGTAGT GGGTATGGAT TTTATGGAAT TTTTAAAACA ACGCCTGGTA 24 0 

TTGCTTAAAG AATTGCTTTC AGAACAAGGC TCTATCTATG TGCATACAGA TTACAAGATA 3 00 

GGGCATTATG TCAAGGTAAT GTTAGATGAA ATATTTGGCA TACAAAATTT TAGAAATGAA 3 60 

ATCACACGCA TAAAGTGCAA TCCTAAAAAT TTTAAAAGGA TAGGCTATGG TAACATAAAA 420 

GATATGATTT TATTTTACTC TAAAGGAAAA AATCCCATTT TTAACGAACC TAAGATCCCT 480 

TATACGCCAC AAGATTTAGA AAAACGATTC CCTAAAATTG ACAAAGATAA AAGGCGTTAC 540 

ACTACCGTTC CAATACATGC TCCAGGAGAA GTGGAAAGTG GCGAATGTTC TAAAGCATTT 600 

AAAGGTATGC TACCTCCAAA AGGGCGGCAT TGGCGCACTG ATATTGCCAC ACTTGAGCGT 660 

TGGGATAAAG AAGGTTTGAT TGAGTATTCT AACAATAATA ACCCTAGAAA AAAAATTTAT 720 

GCCTTAGAAC AAGTTGGCAA AAGAGTCCAA GACATCTGGC AATTTAAACA CCCACAATAT 780 

CCAAGCTACC CTACAGAAAA AAACGCTCAA TTATTAGACT TAATCATTAA AACCTCTTCT 840 

AATAAAGATA GTATTGTTTT AGATTGTTTT TGTGGTTCTG GAACAACCTT AAAATCTGCG 900 

TTTTTATTGC AACGAAAATT TATAGGCATT GATAATTCCG ATTTGGCTAT CCAAGCTTGC 960 

AAAAACAAGC TTGAAACAAT AACAAAAGAC TTGTTTGTTT CTCAAAATTT TTATGATTTC 1020 

CTTGTTTTTT AA 1032 

(2) INFORMATION FOR SEQ ID NO: 53: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

< ii ) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
( B} LOCATION 1. . .531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

ATGACAAGCG TTGTCATCAA GCCCCATGCC TATGGCGAGC AAGTCCAAGA AATAGAAGAA 60 

GAGTCAGATA GCGATTATGA AAAGAATAAC GACCAGGAAG CGATCAATTT TGGTATCGCC 120 

TTGCATAAGG GATTGGAATA CCAATACGCT TACAACATTC CTAAACAAAG CGTTTTAGAA 180 

TATTTAAACT ACCACTATGG TTTTTATGGT TTGGATTACC AAGCGTTAGA AGAAAGTTTA 240 

GAGCTTTTTG AAAACGATGC AGGGATACAA GCCCTTTTTA AAAATCATGC CTTAAAGGGT 300 

GAAGCGGCTT TTTTATTCCA AGGGGTTGTG TCTAGGATTG ATGTTTTATT GTGGGATAGA 360 

GGGCAAAATT TGTATGTTTT AGATTATAAA AGCTCTCAAA ATTACCAGCA AAGCCATAAA 420 

GCGCAAGTGT CTCATTACGC TGAGTTTTTG CGAACTCAAG SCCCCCATTT TAAGATACAA 480 

GCGGGCATTA TTTACGCTCA TAAAAGACTG CTTGAAAAAT YATGGGKTTG A 531 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .7 83 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

fiTTTPTGAGG ATTTGCCTTT TGCGAGCGAT TCGCAATTCA CTTACAATGG GGTGAGCATC 
££S5ccCA CTAATGAGGT CAATGATGTG ATCAGCGGGG TTAATATCAC TTTAGAGCAA 
trrAPAGAG? CTAATAAGCC TGCCATTATC AGCGTGAGCA GAGACAATCA AGCCATTATA 
cacagccSa S^TOTOT CAAAGCCTAT AATGAGCTTA TCCCTAAACT AGACGAAGAC 
^?SStg ^Sctgacac TAAAATCGCC GGGATTTTTA ACGGCGTGGG CGATATTCGT 
re?S?5E£ TAATGTGTTT TCTTATAGCG tgcatacgga TAATGGGGTA 

££S5? ££^3 ESSE 

JScgmSa AGGATATGGG GGGCAGAGAA ATCCACCAAG AGGGCATTTT ttctaaattc 

ScSgtca tcgctaacct catagatgga gggaacgcta aattaaagat ttatgaagat 

TCCCTAGACA GAGACGCTAA AAGCCTGACC AAAGACAAAG AAAACGCTCA AGAGCTTTTA 
AAAACCCGCT ATAACATCAT GGCGGAGCGC TTTCCCGCTT ATGATAGTCA AATCTCTAAA 
^tcSI Egtgcaaatg ATGATCGATC AAGCAGCGGC TAAAAAGAAT 

TAA 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
( iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ix} FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .438 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

ATGCGCATCG TATTTATGGG AACGCCTAGT TTTGCTGAAG TGATCTTAAG GG CGTTGGTT 60 
tSaatgaag ATAAAAAGAT AGAAGTGGTG GGGCTATTCA CTCAAAGGGA CAAACCTTTT 120 

^SSl SSXSSZZ Jg^ccagag actaaaacat acattttaga aaatcattta 180 

AATATTCCCA TTTTCCAGCC GCAAAGTTTG AAAGAGCCTG AAGTTCAAAT "TAAAAGGT 240 
TTGAAGCCTG ATTTTATCGT GGTGGTGGCT TATCGTAAGA TTTTGCCTAA AGAGGTTTTA 300 
ACCATCGCTC CTTGCATTAA TTTGCATGCG TCGTTATTGC CCAAATACAG GGGGGCTTCG J&u 

SaJtStg agatgatact caatgacgat aggatttatg gcataagcac catgcttatg 420 

GAKTTTGGAA TTGGATAG 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCI7US96/18542 



102 



(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .747 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



ATGCGTTTTT 
GACATTAAAG 
AAAAACAGCT 
TTTCAAAAAG 
CGTTTGAATA 
ACTTCATTCA 
GAAGAAAACA 
GAAAAAGGGC 
ATAGGCCAAA 
GATTTCATTG 
GATGGGATTT 
ACTTATAAAA 
GTGTTTTTAA 



ATTTTAAATT 
GCAGCTCTTC 
TACTAGACAA 
AACGCTTGAT 
GTATTTATCC 
TCAA'i'TTAGA 
AGATTTATGG 
GCTTGAACGG 
ATCAAGTCTT 
TCCCAAGCGC 
TTGGAGCGGG 
GCGCGGTGTT 
GCGGTGTGAA 



CCTTTGGCTT 
TTATATCAGC 
CGTTCAAGCG 
TTTAGAAGCT 
TTTAGAAAAT 
AGACACACAC 
CCTTGTCTCT 
GTTTTTGAAC 
AGGCTTTATA 
TGAAATCAAC 
GGTGTTTGTG 
GAAAAACGCT 
AAACTAG 



TTAGGGATTT 
GACCGGGTTA 
TATTTTTTTC 
TTAAAACTAG 
CCAAAAATGA 
AGCGTTTCTC 
CACAACCAAG 
GCCCACAAGC 
GGGACTAATT 
ATAGGCGATC 
GGTGAAGTTT 
TTTTTAAGCG 



TTCTTATTTT 
AAAGCGCCTT 
AAGCCCAAAA 
AAAACGCTGA 
CTTATACCCC 
TCAACCCTAT 
CCATAGGCAT 
GGTGTGCTTA 
TCAAGCAAGA 
AAGTGCTAAC 
CAAGCGTTGA 
AAGCCAAACT 



TTATTTTTTA 
GATGAGCGCG 
CATTAAGGAA 
TTTGAAAGAG 
TACTTTCATG 
TGTGAATTTA 
TGCCGTGCTA 
TAGCGTGATG 
ATTAGTCGTG 
GAGCGGGCTA 
AGATCATTAC 
TTTAAGGCAT 



60 
120 
180 
240 
300 
5 60 
420 
480 
540 
600 
660 
720 
747 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: raisc.f eature 

(B) LOCATION 1. . .360 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57 

TTGGCTTTAA GATTGCCTTT TTTGATCGCT CACGTCATCA ACATGTTTTT ATTCTACCTC 60 

ATAGGGCGAA AGATTTTAAA AAAGCCTAAA GACGCTCTTT ATGTGGTATT GACTTACGCT 120 

TTATTGCCTG GGGTGAATCT CTTTGCGATT TTACTGGCTA AAAGCGTGCT GGTGTTAAGC 180 

CTTGGGCTTT TGATTAGCTA TTTGTATATT AAAACCCAAA AAATCCCTTA TTTAACCCTT 240 

AGCGCTTGCG CGTTTTTAGA CGGTGCGTTC ATCCCGCTTT TACTAGGGGT TTTTGCCTAC 300 

GCTTTAAGGA AAACGGCTAT TTTAAGAGCG CGATCTTTGC TTTGGTGGTT TTAMTTGTGA 3 60 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .327 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 
GTGAATTTAA TGGACTATTT TTCTAAAAGT jrGTTTTTAA ATTCATTGAA ^GCAGCGA 60 
TTGATCGTCT CCAACAAATT AGCGATTGAC GTGCAATACG GCATGCTCCA 180 
AAAAATTACC CTGATGTGGT GGATGGGGGT GTTAGGGAGG GGCCTTTTTG f™?^ 24Q 
S5S ASS S i-OCMTQC TGATGCCATT 300 

GATAGTTTCT TCAGCAAGAA TGATTAG 
(2) INFORMATION t*OK SEQ ll> NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

TTGGCGTCTC GCTATTCTGT GGCTGTTGGG ^ATTTT CJGMCMTT ^TAT^TTTA 60 

AGAAATGAAA CCATGACCAA TCTCATTGGT ™ACTGG TGTTGGCGTC CAJTTC J 18Q 

TTTTTTTTAG CTCTTGGAGT GTTGCTAGGC AAGATGTTAG ^i^JJ^rZ rrTTTC TTTC 240 

ISJIagg cgttagggtt tattttttca tgtttgaaga cttttttagt ^ttctttc 

ATCCTTTATG CGCTCTCTAA AATGGATTTA ATGAAAGACG CTAACGCCTA 360 
{ESScTA TTTTCCCCAC CATRAAAARC RTCRCTAGTA ^CATGCG gTTGATGGC 3 
GTCAAACATG TGGAGAAAAA CCTTAAAGAC ^^GAAG AAA^GAGCGA TGAAGTTAAA 
AATAAAGGAT CTATTGATAA CGCCAAAGAA TCTTTTAATA AGGGCTACGG ATAA 

(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 base paxrs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . .246 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

TTGAGCAAGC AAAGCGCGGA CATTGTGATC ACTAATGACT CCTTAAGCTC TTTAGTCAAA 
GTTTTAGCGA TCGCTAAAAA AACTAAAAGC ATTACTTGGC AAAATATCTT GTTCGCTTTG 
GGGATTAAGG CGGTTTTTAT CGTGCTAGGG CTTATGGGGG TAGCGAGCTT GTGGGAAGCG 
GTCTTTGGCG ATGTGGGGGT TACGCTTTTA GCYTTAGCCA ATTCCATKCG CACGATGAGG 
GCTTAA 

(2) INFORMATION FOR SEQ ID NO : 61 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(iii I-JOLECULE TYPE. DNA {genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .240 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61 

ATGAAAAATT TAAGGCATTT TAGAAAGCTT ATCGCCTTTT TAGGTTTTTC ACCTCTTTTA 

TTACAAGCGG ATATGACTAC CTTTTTTAAT TCCATTGAAC AACAGCTCAC TAGCCCTACG 

GCTAAAGGCA TTTTAATGGT TATTTTTTTA GGACTTGCTA TTTTTATATG GAAAAACTTA 

GATAGATGGA AAGAAATTTT AATGACCGTG CTTGCTTTAA AAGRAGTCCC CATGCAATMW 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .978 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

TTGGCGGGTT TGCYAGTGGG GTGTATRCGG ATGAAACAAA CATTTTGGGR ACTTAGTTGG 60 

GGGGAAAAAA GCCAAAAGGT ATGCGTGCAT CGTCCATGGT ATGCTATATG GAGTTGCGAT 120 

AAATGGGAGG AAAAAACACA ACAATTTACA GGAAACCAAC TCATCACAAA AACTTGGGCA 180 

GGGGGTAATG CGGCTAACTA CTACCACTCT CAAAACAACC AAGACATCAC AGCCAATTTA 240 

AAAAATGATA ACGGCACTTA TTTTTTAAGC GGTCTGTATA ACTACACCGG AGGGGAATAT 300 

AATGGGGGGA ATTTAGACAT TGAATTAGGC AGTAACGCTA CTTTTAATCT AGGTGCGAGT 360 

AGTGGGAATA GCTTCACTTC TTGGTATCCT AATGGGCATA CTGATGTTAC TTTTAGCGCT 420 

GGGACTATCA ATGTGAATAA CAGCGTAGAA GTGGGCAATC GTGTGGGATC GGGAGCTGGC 480 

ACGCACACCG GCACAGCCAC TTTAAACTTG AACGCTAATA AGGTTACTAT CAATTCCAAT 540 



60 
120 
180 
240 
246 
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aTrarrrc-T ATAAAACT-C GCAAGTGAAT GTAGGCAATG CTAACAGCGT TATTACCATT 
SSESSS c?™tc§ GGAAACTTGC AGKTCTTTAG CTAGGGTGGG cgtaggggct 
AATTGCTCCA CTTCTGGGCC TAGCTATTCT TTTAAAGGGA CSACTAACGC TACTAACACS 
SSStAGCA AWTCAAGCGG SAGTTTCACT TTTGAAGARA ACGCCACTTT TAGCGGGGCG 
^ItIIaatp rrGGGGCA-T CACTTTCAAT AAAAAGTTTA ACGCTACCAA TAATACCGCT 
S£S££g S??"^ SSSSSk A^AAGCTCTT TTAATGGTGC GAATTTTAGT 900 
HcSScI? aScTTTTAA TAATCAAGCC ACTTTCCAAA ACAGCTCCTT TAATGGGGGG 960 
ACTTTTACTT TTAATTGA 



600 
660 
720 
780 
840 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

{iii MOLECULE TYPE. DNA (genomic) 



(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : misc.feature 

(B) LOCATION 1. . .816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

TTGTTAAGTT TAGTTAAAGG GAAAACCATG CTCCGCTCTC TCTATAGTGC CACTTCAGGG 60 

ATTCTCGCCC AACAAACGCA CATTGACACC ACTTCAAACA ACATCGCCAA TGTCAATACC 120 

iSSGTTTA AAAAATCTCG CGCGGATTTT AACGACTTGT TTTACCAAGC GATGCAATAC 180 

GPCGGCACCA ACACAAGCAA CACGACTTTA TCGCCAGATG GCATGGAAGT GGGCCTTGGC 240 

SacgcccS gtccSattac CA^AATGTTT TCGCAAGGCA GCCCTAAAGA AACGGAGAAT 3 00 

SJSSSS TTGCTATTAC AGGTAAAGGC TTTTTTCAAG TCCAGCTTCC TGATGGCACT 360 

^CCGCTTACA CAAGGAGCGG GAATTTCAAG CTAGACGAGC AGGGCAATCT TGTAACAAGC 420 

GAGGGCTATC TCCTCATCCC TCAAATCACT TTACCCGAAG ACACCACGCA AGTGAATATC 480 

ggSgga?g SacggSg CGTOACTCAA GGCTTGCAAA cgacttctaa CGTGATCGGG 40 

CAAATCACT^ TGGCTAATTT TGTCAATCCG GCGGGGCTTC ATTCTATGGG GGATAATTTG 600 

SJJtccatca CCAACGCTAG CGGCGATGCG ATTGTGGGCA ACCCGGATTC TCAAGGCTTA 660 

gSSg?TAA §GCAAGGCTT TTTGGAGCTT AGTAACGTGA GATTGGTAGA AGAAATGACA 720 

GATCTAATCA CCGCTCAAAG GGCTTATGAA GCCAATTCTA AAAGCATTCA AACCGCTGAT 7 80 
GCCATGCTCC AAACAGTCAA TTCCCTCAAA CGCTAA 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1. . .273 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 64 
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ATGCAAAATG GGTATTATGC GGCCACAGGG GCAATGGCTA CACAATTTAA CCGCTTGGAT 60 

TTAACCTCTA ACAATTTAGC CAACCTAAAC ACCAACGGCT TTAAAAGAGA CGATGCGATT 120 

ACAGGCGATT TTTTAAGGCT TTACCAAGAA TACCGAGAGC AACTGCCCTT AGAAGATCAA 180 

ACCAAAGCGA GCGCGAAGTA TCTCAACCGC AMCCTCAATC GTGTGCCTAT TCTATCARAA 240 

ATCTATACKK ATAGGRAGCT TGGCYGCGTT TGA 273 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1. . .585 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

GTGGGGGCTA TGCCTACTAT CCAAATCCGT GRCTTTGGAG CGGGGGGTTC AGGGCATAGC 
GATGCGACGC TCATGTTAGT TAATGGTATT CCTGTTTATA TGGCCCCTTA CGCTCACATT 
GAGCTAGACA TTTTCCCTGT TACCTTTCAA GCCATTGATC GCATTGATGT GATCAAAGGT 
GGAGGCAGCG TGCAATATGG GCCTAACACT TATGGGGGTA TTGTCAATAT CATCACTAAA 
CCTATCCCTA ATCAATGGGA AAACCAAGCG GCTGAAAGGA YCACTTATTG GGCTAAGGCT 
AGAAACGCTG GGTTTGCCGC TCCCCYTGAT AAAACCGGCG ATCCTTCTTT CATCAAGTCT 
TTAGGCAACA ACCTCCTCTA TAACACTTAT GTGAGGAGCG GAGGGATGAT CAATAAGCAT 
GTGGGTATCC AGCGCAAGCT AACTGGGTTA GAGGCCAAGG CTTTAGGGAC AATAGCCCCT 
CTAGTATTTC AAACTATTGG CTGGATGGGG TCTATGACAT CAATGAAAGC AATGGGATTA 
AAGCCTATTA CCAATACTAC GATTTTGGCT ATCGSCCAAC CGGGA 

(2) INFORMATION FOR SEQ ID NO: 66: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

ATGAGAWAGG AGAAAATAAT GACGAATTTT GAAAAGRTTA TCGCGCAAAA CAGGCTCAAA 
ACGAACGCGG TTTTAACCAC TTACTGCGCG ATTTTTGCTT TTATTGGGTT GTTGGTGGAT 
GCTATTAGAA TCAACGCTAA TGATTTAGGT ATAGCCCTTT TTAAACTCAT GACTTTTCAA 
ATTTTTCCTA CGRTTACTAT TGTCATGTTT GTGGTGGCTT TTGTCATTAK KCKKAGTTTG 
TATCCAAAAT TTTAG 



60 
120 
180 
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360 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : circular 

{ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

ATGGRCATGT CTCATATTAT TAAGAGCATT GAAGCTTTAG 
ACGCTTAATG GGCCAGAAGC CCCGTTTTTA GCGAATTTGG 
TTGAGTAAGG ATTACGCTGA TTACTTGGCT CAAAATAATA 
AAMCCTGTTG GGACAGGGCC TTTCAAATTC TTTTTGTGGA 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

TTGATGAGGA AAATTTTTTC TTATATTTCT AAGGTTCTAT 
GCAGAGCCTG ATTCTAAAGT GGAAGCCTTA GAAGGGAGGA 
AAAAAAATCC GCCAAGAATT GAAGAGTAAG GAATTGAAGA 
GATTTGAAAA ATAAAGAAGA AAAGAAAGAA ACAAAAGCCA 
GTCCATCATG GGGACGCCAA AAATCCCACT CCAAAGATCA 
AGTAGTAAGG GCGTTCAAAA TCAAGGCGTT CAAAACAACG 
GATACAACCC CTCAAGCTAC TGAAAAAAAT AAGGAAACAA 
TCCATTTTTG GTAATCCTAA TAACGCTACC AACAACACCC 
GGCATTTCAT TGCTTGTTAA TGGTTCGCCT ATCACGCTGT 
GAAAAATCTA AAGTGAGYAA SGCTCRAGCT AGGGATCGTT 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



ATGACTATAC CATTAGATTC 
GCATGGACTT TTTAAGCATT 
AAAAAGACGA GTTGGCTAAA 
ATAAAAGATG A 



60 
120 
180 
231 



TATTTATTGG 
AGCAAGAGTC 
ATAAGGAATT 
AGAGAAAACC 
CGCCTCCTAA 
CGCCAAAACC 
GCCCTAGCTC 
TTGAAGATAA 
ATCAAATCCA 
TGAKTCKCTG 



GGTGGTTTAT 
TTCTTTGGAT 
AAAGAACAAG 
CAGAGCAGAA 
AATCAAAGGG 
TGAAGAAAAA 
TCAATTCAAT 
GGTCGTAGGG 
AGAAGAGCAA 
A 
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(D) TOPOLOGY: circular 
<ii) MOLECULE TYPE: DNA (genomic) 
(iii> HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc__f eature 
<B) LOCATION 1. . .54 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

ATGAAGAGAT CTTCTGT^f" 71 r rti,i^ f T ir r r nr it v T ^Q 
CATAAAATGG ATAATAAGAC TGTGGCTGGC 
CCTGTTACTA CAGAACCAGC TCCAGAGAAA 
GTTGAAGAAA AGCCGGCTAT TGAAAGCGGG 
GACAAGTATG AGATCAAAGA ATCCGATCAA 
AAAGAAAACC ACATGCAAGT GCTTTTGGAA 
TACAACCAAG CGCTTGGCGT TAAAAGGACT 
GGGGTAGAAA AAGATATGAT CAAAACCATC 
CAAAAAACTA GAGAATGTTA CAGAGAAAAC 

(2) INFORMATION FOR SEQ ID NO: 70 



G T AGC TTTT T TATTG^T A^t 1 
GATGTGAGCA CTAAAGCGGT 
GAAGAGCCTA AACAAGAGCC 
ACTATCATCG CTTCTATTTA 
GAGACTTTAG ATGAGATCGT 
GGCAATACCG ATGAATTTGG 
TTGAGCGTGA AAAACGCTTT 
AGTTTTGGCG AAAGCAAACC 
AGAAGAGTGG ATGTCAAATT 



TGGCTGTAGT 6n 

TCAGACTGCG 120 

AGCTCCAGTG 180 

TTTTGATTTT 240 

GCAAAAAGCT 300 

CTCTAGCGAA 360 

AGTCATTAAA 420 

CAAATGCGTC 480 

AGTGAAGTAA 540 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .861 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

ATGGGAACGC TCATTGAAAA ATGGTTTGGC TTCTCTCAAA TCAGAGAAGA ATTAGAAGCT 60 

CGCATCAGTG AGTTAGAAGA CGAAAACACC GAATTGTTAA GAGAAAGAGA ATACTTAGCT 120 

GCAGAAACTA GCGAGTTAAA AGACGCTAAC GATCAATTAC GGCAAAAAAA CGACAAGTTA 180 

TTCATAACAA AAGACAAGCT AACCAAAGAA AACACCGAGT TATTCGCAGA AAACGAAAGC 240 

TTATCTGTAA AAATCAGCGG GTTAGAACAC TCTAACGATC AATTATGGCA AAACAATAAC 300 

AAGCTCACTA AAGAAAAAGC AGAACTGAAA ACGGAAAAAG ACATTTTAGC TAAAGAAAAC 3 60 

ACACGCTTAT TAGCAGCCAG AGATCGGCTG ACTGAAGAAA AAAGAGAATT GACAACAGAA 420 

AAAGAAAGGC TAAAAAGAGA AAACACCGAG CTAACCCATA AAATCACCGA GCTGACTAAA 480 

GAAAATAAAG CACTAACCAC CGAAAACGAC AAGCTCAACC ACCAAGTTAC CGCGCTCACT 540 

AATGAGCGAG ATAGTCTCGA ACAAGAGCGA GCGCGATTGC AAGATGCGCA TGGGTTTCTA 600 

GAAAAACGAT GCACCAATTT AGAGAAAGAA AACCAACGCC TAACTGACAA GCTCAAACAA 660 

TTAGAAAGCG CTCAAAAAAG CTTGGAAAAC ACTAACAATC AATTACGGCA AGCTTTAGAA 720 

AACTCTAATG TCCAATTAGC ACAAGCTAAA GARARAATWG CCATAGAGRA AAGCGAGCTG 7 80 

GMGCGAAGAA ATCGCACGCT TGAAGAGCTT AGAGGGTATG GAAGCCAAAA GSCGATCTGG 840 

ACTTACACAW CAGGCGTTTA G 861 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 33 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

PTTTTrrCCA AGCTTTTGGG TAAAAATTGC ATAGAAACGC ATAAGGGGGT GGGCTATCGC 
?5£?SS atoIIaaaaa ATCCCTCAAA CTCTTTTTAG GGACTTATTT aggctcttcg 
TTTGTGTTAA TGCTAGTGAT TAGCGTTTTA GCGTTTAACT ATGAAAAAAA CGAAAAAATC 
IIIrtgatac GCATGGACAT GGACAAAATG GCTTCTAAGA TCGCTAGTGA aattatccaa 
toSJSSS AAACGCATGC GGATTATCAC AACGCTTTAA ACGCCCTGAT TTCACGTTAT 
AAAGACGTTT CCATAGYCCT YTYYGAYACG TAA 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .375 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

TTGATGACCA AAAGCTTAAA ACTCATTCAA AAAGGGGTTA AAAA^CTCTA ^AAACCCTT 
AAAAATAGGG CTTTAGAGCA TCAAGACACG CTAATGGTGG GCAGAAGCCA TGGGGTGTTT 
^Saaccca TCACCTTTGG TTTAGTTTTA GCTCTTTTTG CTGATGAAAT CAAACGGCAT 
SSJagccc TGGATTTAAC GATGGAATTT ATCRGCGTWG GGGCGATCAG TGGGGCTATG 
rrrAATTTCG CGCACGCCCC TTTAGAATTA GAAGAATTAG CGTGCGGATT TTTAGGCTTA 
S^£?£ SStcmSa TCAAGTCATT CAAAGAGACC GCTACGCAGG CTTGCATGCG 
ATCTGGCTCT TTTAG 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 
{B) LOCATION 1. . .288 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

TTGTCAGACG CTTCAAAAAG ATCCCTTAAT CCAACCTTAA TGATGAATAA TAATAATACC 60 

CTACCCAAAC CCCTAGAAGA AAGCCTAGAT TTAAAAGAGT TTATCGCTCT TTTTAAAACC 120 

TTTTTCGCAA AAGAAAGAGG TTCTATTGCT TTAGAAAACG ATCTCAAACA GGCTTTCACT 180 

TATTTAAATG AAGTGGATGC GATCGGTTTG CCTGCCCCCM AAAAGCGTGA AAGAAAGCGA 24 0 

TCTTATTGTT GTCAAACTCA CCAAATTAGG GACGCTCCAT TTAGATGA 288 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

TTGCCTATTA TTTTAYCTGT AATCGTGATG ATGTTTTTTT CCAAAATCGT TGGCGATTTT 
ATTGAAAAGC ATTATCGCGT CAAAACTTTA GCCTTTGTGT TTTTGCTCGT TGTGGGCGTG 
TTTTTGTTTT TAGAAGGCTT GCATTTACAC ATCAATAAAA ACTATTTGTA TGCGGGTATT 
GGTTTTGCCT TGCTCATAGA ATGCTTGRAT ATTTTCATAG AAAAGAAAAT GAAAAAAAGT 
TAA 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .798 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75 
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ATGATAAAAG CGCGGTTTAA AAAACGCCTT TTAGGATCTA 
ATAGACTTAG AAATTAAAGA AGCAGAAGTT GTCGCTTTAT 
AAAAGCACGA TCTTACGCAT TTTAGCAGGG CTTGAAGCGG 
GCCAATCATT CAGTATGGTT AGACACTCAA AAAAAGATTT 
AAAATCGGCT TTGTGTTTCA AGATTACGCC CTATTCCCTC 
ATCGCCTTTG CTCACCCTAA AGATAAAAAT AAAATCCACG 
TTAGAAAACC TAAGCCAGCA AAAAATTCCC AAACTCTCTG 
GCTTTAGCAA GAGCTTTAAT CGCAGCCAAA AATCTATTGC 
GCCCTAGATA ACGCCTTAAA AAACGAGGTG CAACAAGGTT 
GAAAATTTAA GCGTGTTATT GGTAAGTCAT GATCCAAACG 
ACTTTCCTCT TTTTAAACAA TGGCGTTATT GATCCTAATC 
AACCGCTTAT TGGTAAAACC TCTCTTTGAA GATGAAAATT 
CCTCAAACGA TCAGTTTGCC CAAAGATTGT CTGAACCCAA 
CAAAACAAAA AATTTTAG 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .195 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76 

GTGAAATTCA GCGTTTTAAC CCTTTTCCCG CAACTCATCT 
ATTTTAAAAA GAGCGTTAGA AAAAAACCTT TTTGAATTGG 
TTTAGCGCTA ACAAATATCA AAAAGCGGAK TCACACGCTC 
AATTTTAGAC CCTGA 

(2) INFORMATION FOR SEQ ID NO: 77 : 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 414 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

TTGTGGCGCA CACCGAAGAC RCCCTTAGTC ATTAAACCCT 
TCAGAGATYT TTGCGGYCAY GTGCGTGGGY ATGGCYAGCG 



GGGGCGCGTT 
TAGGAGAATC 
TGAGTAGCGG 
TTTTAAAACC 
ATTTAAACGT 
AAGTGTTACG 
GCGGGCAAGC 
TTTTAGATGA 
TGCTTGATTT 
AAATAACCAA 
AAGAAAATCG 
ATTGCCATTA 
CTTTTAAGCT 



TGATTTGAAT 
GGGAGCGGGT 
CTATATTGAA 
ACAACAGCGA 
GTATCAAAAC 
CTTAATGCGT 
CCAACGAGTC 
GCCTTTAAAC 
TATCAAGCGT 
ACTCGCGCGA 
GCTTTTTTCA 
TGAGGTCATT 
TGATTTCATT 



TGCCTTATTT TGAAGATTCT 
AAGTGTTAAA CCTTAGAGAT 
ATTGGTGGGG GTGCGGGGCA 
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ATTTGAAAAG CATGAGCGAT 
TTRCGGGGCC TGTGTTAGCC 
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CCGCATCGTT TATGTCCGCT 180 

ACGAAACCAT TTCTAGCCAT 240 

CTAYCGCWMA TGGGGCAAGC 300 

TAGCCTTTGT GGGGATGGTC 360 

TAGGCATGGA GCAT 414 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 48 base pairs 
{B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 



GGGTATGCGA GCATGGGCAT 
CCTGGGGGGT TGCTGTTCGC 
GCAGATGTTT CTGCAGAAGA 
ACAGGGGYTC ATTTAGCCTT 
GCGCTCGTTA ACGGGCTTTT 



TCCTTTACCT TATTTAATCG 
TAAAACCATT TACCCGCAAA 
GCATGTCAAT ATTATAGAAG 
GCATGTGGGG GCGATGCTTT 
AGGGGTTGTA GGGGGATTTT 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .34 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

GTGATGAACT TTTTTGTGGG CGGACTTTCC ATTGTTTGTA ATGTGGTGGT CATCACTTAC 60 

TCCGCGCTCC ACCCTACAGC CCCTGTAGAA GGTGCAGAAG ATATTGTTCA AGTATCGCAC 120 

CATTTGACCA GTTTCTATGG GCCAGCGACT GGGTTATTGT TTGGKTTTAC CTACTTGTAT 180 

GCCGCTATCA ACCACACTTT TGGTTTGGAT TGGAGACCCT ATTCTTGGTA TAGCTTATTC 240 

GTAGCGATCA ACACTGTTCC TGCTGCGATT TTATCCCACT ATAGCGATAT GCTTGATGAC 300 

CACAAAGTGT TAGGCATCAC TGAAGGCGAT TGGTGGGCAA TCATATKG 348 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_£ eature 

(B) LOCATION 1. . .684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

GTGCTTTTGG GCAAGCATAG TGGGGCGGGG TTGTTGAGCG CTTTAWGCGC GTTAAGTTTT 60 

GGATCTGGGG TGGTGAGTAT CCAAGCGTTA GAGTGCGAGA TAACTTCTAA TAACAAGCCT 120 

TTAGAATTGG TTTTTTGTGA AAATTTCCCT AAAAAGCTCA GCGCGTTCGC TCTTGGCATG 180 

GGGTTAGAAA ATATTCCAAA GGATTTTAAG AAGTGGCTTG AATTAGCCCC ATGCGTTTTA 24 0 

GATGCGGGCG TTTTTTATCA TAAAGAAGTG TTACAAGCCT TAGAAAAAGA AGTGATCTTA 300 

ACCCCTCACC CTAAAGAGTT TTTATCGTTA TTGAAATCAG TGGGGATCAA TATAAGCATG 360 

CTAGAATTAC TAGACAATAA ACTAGAAATC GCAAGGGATT TTTCTCAAAA ATACCCCAAG 420 

GTGGTTTTGC TTTTAAAGGG GGCTAATACC CTAATCGCTC ATCAAGGGCG GGTTTTTATC 480 

AACAATTTAG GGAGCGTGGC TTTRGCCAAA GCAGGCAGTG GCGATGTGTT AGCGGGGCTG 54 0 

ATTGTAAGCC TACTTTCTCA AAACTACACG CCTTTAGRCG CCGCSATTAA CGCAAGTTTG 600 
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GCGCACGCCC TAGCGGGTTT ARAATTTAAG AATMATTAMG CTTTAACGCC CSTAGATTTG 660 
ATAGAAAAGR TCAAACGACT ATAA 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .328 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 80 

GTGCYTTTAT ACCTAGCACT AACCTTGAGT TTAGOCATTG CTATGCTTTT AGTGGAAATG JO 

CTGATTGGAA ATTTGGGTAA AAAAGACGTT GT TTCCAAli CATTCTATCT 180 

AGGAAAAAAT ATTACCCTTT CACTTCTTTT ™TTTAG GCGGCCCTCT CATTCTATCT ^ 

TTTTATGCGG TGGTGTTAGG CTGGGTGCTT TACTATCTTT TTGTAGTAAC 300 

CCTAAAGATT TAGMGCAGGC TAAAATGCAR TTCMGMATGC TTCAAAATGG CAGTTTGATC iUU 
TGGCCTGTTA TTGACTTTAG CGCATGCT 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .294 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

TTGACAACAA AAGCGTGTTG GTTGCTTCGG GTTTGTTGTT A^GAAGTCT AAATATTACA 60 
ATCAAGGATA GAACGATGAA AACCAATGGT CATTTTAAGG ATTTTCCATG GAAAAAATGC 
TTTTTAGGCG CGAGCGTGGT GGCTTTATTA GTGGGGTGTA ^CCCGCATAT ™TTGAAACC 
AATGAAGTTG CTTTGAAATT GAATTACCAT CCAGCTAGCG AGAAAGTTCA AGCGTTAGAT 
GAAAAGATTT TACTTTTAAG GCCAGCTTTC CAATACAGCG AKAATATTTG CIAA 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobaccer pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .438 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

TTGAGTGAGT GGCAAACATT TTGTTTAAAA GATTTAGGGA AAATAGTCGG CGGCGCTACC 60 

CCACCTACCA ATAACCCCAA AAATTATGGC AATAAAATTG C"i*TGGATTAC CCCTAAAGAT x20 

TTATCCACTT TACAAGGGCG CTACATTAAA AAAGGCAGCC GCAGCATTTC ACGATTAGGG 180 

TTTAAATCAT GCTCTTGTGT GTTGCTCCCA AAGCATGCCA TTTTATTTTC TTCAAGAGCT 24 0 

CCCATAGGTT ATGTGGCAAT TGCTGAAAAA AGGCTATGCA CCAATCAAGG TTTTAAAAGT 300 

ATTATCCCTA ACAAAAAAAT TTATTTTGAA TTTTTATATT ACTTATTAAA ATACTATAAG 3 60 

GATAACATTT CCAACATAGG GGGCGGAACT ACTTTTAAAG AAGTTTCAGG GGCTACTTTA 420 

GGKTCTATTC CAAGTTAA 43 g 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: circular 

{ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
(B> LOCATION 1 . . .822 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 



ATGGAATTTA 
TTGTTAGCCG 
TTGAATAGCA 
GGTCTTACCG 
GCTAATGGGG 
AAGTCGCTAG 
GGGCATGCCG 
TCTTGGGGTG 
GGTATTTTTG 
TGGAAAGAGC 
CCTAATGCCC 
GGGGTGAGAG 
CTCATCAATA 
CGGGATTATT 



TGAAAAAGTT 
AAGGTGATGG 
ATATTTATAA 
CTAATAAGCA 
CTTTGAATGG 
ATATGACAAG 
ATTTAGGTAA 
TGGGGAGCGA 
GTGGGGTCGC 
AAATCATTGA 
CTTATAGCAC 
CCAATATCTA 
AATTTTTGAG 
CGCTTTATTT 



TGTAGCTTTA 
TGTTTATATA 
TACAGGGGAT 
TAATCCAGGA 
TTTTGGGTTG 
CAAGTGGTTT 
ACAAGTTTAT 
TTTGTTAGCT 
TATCGGCGGT 
AGCCAAAGGT 
CAACACTTCA 
CAAGCATAAT 
CGCGGGTCCT 
GGGGTATAAC 



GGGCTTCTAT 
GGGACTAATT 
TGCACAGGGA 
GGCACCAATA 
AATGTGGGTT 
GGTTTTAGAG 
GCACCTAATA 
GATATTATTG 
AACACTTGGA 
CCTGATGTTT 
ACCGTCGCTT 
GGCGTGGAAT 
AACGCTACTA 
TACACTTTTT 



CCGCGGTTTT 
ATCAGCTTGG 
GTGTTGTAGG 
TCAATTGGCA 
ATAAGAAATT 
TGTATGGGCT 
AAATCCAGTT 
ATAAAGACAA 
AAAGCTCTGC 
GTACCCCTAC 
TTCAAGTGTG 
TTGGCGTGAG 
ACCTTTATTA 
AA 



AAGCTCTTCG 
ACAAGCCCGT 
TTGCCCCCCA 
CTCCAAATAC 
CTTCCAATTC 
TTTTGATTAC 
GGATATGGTC 
CGCTTCTTTT 
AGCAAACTAT 
TTATTGTAAC 
GTTGAATTTT 
AGTGCCGCTA 
CCATTTGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
822 



(2) INFORMATION FOR SEQ ID NO: 84: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc^feature 

(B) LOCATION 1 . . .447 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

m „-,~-«~~ ^„^^*™ r m -tgcCCCCCC TTGGGGTCTA TATGGATCCA 6C 
I^CCGG^ GCaSaTTCC ttSSSSSSS CATACGCTTC AGAGACTTTG 120 

a£J252S TOCTTCTOT CCCTAAAGAT CAATGGGATT CAAGCTTGAG TTTGGGCTTG 180 
S S "ESS™ GCATGTCATC TTTTTTCAAG CGCTCAAAGT CGCCACGCCA 
StaItSa ^CTTTCAT CAGCCTTTTT AAAGAAACTT CTTTAGCGTC TGTGGTCACT 
^I^rc kttttaSat CGCACAGCAA AAAGYGAACG TCAGCTATGA CTTTYYGCCT 
aSaSgg SSSSS GATTTACTGG CTTTTTTGCT TGGTTTTAGA AGTGATTCAA 
AAGCGCGTGG AAAAAATCTT AAATTAA 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(iii MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

r<vrcvcrcTG ATGAAGTTAG GAATTTAGCT GGGCGCACTC AAAAGTCTTT AGCCGAAATC 
JSJSgtgat TGTCCAAGAA ATCAATGATG TGAGTTCGCA aatgaatctc 
AATTCGCAAA AAATGGAGCG CTTGAGCGAT ATGAGTAAAA GCGTGCAAGA AACTTACGAA 
^IItgagt- CTAATTTAAG CTCAGTCGTT TTAGACAGCA ATCAAAGCAT GGACGATTAC 
SJ^tccg SScSKt TGAAGCTATG GTAAGCGATT TTGCAGAAGT ggaaaaagtg 
gS^Sga ctttggctga ttcttcagat ATTTTAAACA TCGCTACGCA tgtgagtgga 
acgaccatga atttakacaa acaagtgaat ttgtttaaaa cttaa 

(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



240 
300 
360 
420 
447 



60 
120 
180 
240 
300 
360 
405 
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(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

ATGAATTACG ATAACTATTG GGATGAGGAC AAACCAGAAC TCAATATCAC GCCTTTAGTG 60 

GATGTGATGC TTGTTTTATT GGCTATTCTT ATGGTAACGA CGCCCACTCT CACCTATAAA 120 

GAAGAGATTG CCTTGCCTTC TGGTTCAAAA ACTGCTAGAG CCACTCAAGA TAAAGTGATA 180 

GAGATACGCA TGGATAAAGA CGCAAAAATC TATATAGATA GTCAAACCTA TGAATACAMC 240 

TCTTTCCCGG ACACTTTCAA TTTGCTTTCT AAAAAATACG ATAAAGATAC TAGGGTGAGT 300 

ATCCGTGCGG ACAAGCGATT GACCTATGAC AAAGTGATTT ATTTGTTAAA AACGATTAAA 360 

GAAGCGGGGT TTTTGAAAGT T7CTTTAATC ACAAGTCCTT AA 402 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

{iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

ATGCCACCCA CACSCCCCCA AGCGAGTATT TTAAGGCTAA CCCTAAAAAA CCCTTTGMGC 60 

MTGCTATCTC GTTATTCGCT CTGTCTGTTG AAAAAAACGC GCTTGCAAAC AACATCAAAC 120 

AGCGCACCAA AAGCATGCTT GATTGCGGGC TTATTGAAGA AATCAAAGCC CTTTATATTA 180 

AATACCCTAA AGATTCGCAG CCTTTTAAAG CCATAG 216 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .654 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
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TAAGAGTTTT AGTAATGCTT GCAACAATGA TGATGAAATT AGTAAAAACG 60 

SS&^ aSSctttt TAAGAATGTG GGAATATCTA TAATGGGGAT TGCTTTTTGG 120 

gSgcga^a AAGACTCGAT AAAAAAACAA ATTAAAAAAA GCGATTGGAT ATGCGGGAAT 180 

SSSicS CGGATGATTA TTTAAAAACG CATCCTAACT CATGGTTTAA TTCAGCAATA 240 

GGTGTAACAG CGATAACAGC CATGCTTATG AATGTGTGTT TTGCTGATGA CC AATCCAAA 3 00 

SS^Sg ctcaagctca AAAGGAAGCT GAAAACGCTA GGGATAGAGC gaacaagagt 360 

GGGATAGAAC TGGAACAAGA AGAGCAAAAG ACAGAACAAG AAAAACAAAA GACAGAACAA 420 

SSSSaa AGACAGAACA AGAAAAACAA AAGACAGAAC AAGAAAAACA AAAGACAGAA 4 80 

?^^AAAC AAAAGACAAG CAATATAGAG ACTAACAATC AAATAAAAGT AGAACAAGAA 540 

cS^A CAGAACAGGA AAAACAAAAG ACAAACAATA CGCAAAAAGA TTTGGTTAAC 600 

AAAGCAGAAC AAAATTGCCA AGAAAATCAT AATCAATTCT TTATTAAAAA TTAA 654 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 
(D) T\TC: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . - .228 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

ATGGTTATTT CTGGGCATTT CACCACTTAT AGCTATATTG AGCCTTTTAT CATTCAAATC 60 

AGCCAATTTT CTCCTGACAT TACAACGCTA ATGTTGTTTG TGTTTGGGTT AGCAGGCGTG 120 

GTGGGGAGTT TTTTGTTCGG CCGTTTGTAT GCGAAAAATT CAAGAAAATT TATCGCTTTT 180 
GCAATGGTTT TAGTCATTTG CCCGCAACCT CTTGCTTTTT GTGTTTAA 

(2) INFORMATION FOR SEQ ID NO: 90: 

(iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1. . .576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

ATGAAA^CTA CAAGAATTGG TTCTAAAATT GTCATGATGG TGTGTGCGGT TGTTATTGTC 61 

ATTAGCGCTG TTATGGGCGT TATTATCAGC TACAAGGTTG AAAGCGTGTT GCAAAGCCAA 121 

GCCACAGAAT TGCTGCAAAA AAAAGCTCAG TTAGTCAGTT TTAAAATTCA AGGCATTATG 18< 

AAGCGCATTT TTATGGGCGC TAACACCCTT GAAAGGTTTT TAAGCGATGA AAATGGCGCT 24i 

ATTAATGACA CCCTAAAAAG ACGCATGCTC TCTGAGTTTT TGTTAGCAAA CCCTCATGTG 30 
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TTATTGGTTA GCGCGATTTA TACGAATAAT AATGAACGAA TGATCACTGC AATGAACATG .3 60 

GATTCAAAAA TCGCCTACCC TAATACCGCA CTCAATGAAA ACATGACCMA CCCAATCCAT 42 0 

TCGCTCAAAA GTATAACCCG TTCARATCCC TATTATAAAG AGGTTAATGM CRATAAAATC 48 0 

TATRRCATRR ACATTACCCT CCCCCTAATR RRCAARAATY AAAATRTTAT ARRCRCWCTW 540 

AATTTCKTTT TAAACATTGA CWGCTTTTTA TACTGM 576 



(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 
{ B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
ID) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANT I- SENSE: NO 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc^f eature 

(B) LOCATION 1. . .762 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 



ATGGCATACA 
TTGGATTTGT 
CTCACAAGCT 
ATCGCTGAAG 
GGAGTCTTAT 
AAGAAAACTG 
CTAGAAGAAA 
ACGGACAATT 
GGCTTTAAAT 
GGGCGTGGTT 
GGCCCTGTTG 
TATAGGGTAA 
GCCAATGAAG 



AATATGATAG 
TCGAGGTGCT 
CCATAGAATA 
AGTTGCAATA 
ACAAAGAGAT 
AAACGACTTT 
TGGATGATGA 
TGAACAGACA 
CTTATCAATT 
TATCGCTTGC 
GCTGGATCAT 
CCATACCGGC 
ATAAGAAGTC 



AG AC TTGG AA 
TGTTTTTGGT 
CAAAAGGCAT 
CTATGGGAGC 
TTTATGCGAT 
AATTGAACAA 
AGAAGTGAAA 
AGCCTTAAGC 
AGCTGTCATT 
GGGCAATCAA 
TACAGGCGTA 
ATGCATTGTG 
GTTGCAAATA 



TTTTTAAAGC 
AAAGACGGCG 
GGCGATGATT 
AATAGTTTTG 
GTGTGCGATA 
AACATGCTTT 
GAAATGTGCG 
GCGGCGACTT 
GTTGCGAATG 
GTGCTTACAA 
TGGACAGCGA 
GTCGCCACTT 
GAATCCGTTT 



AACTGGAATC 
AAAAAAGACA 
ACGCTAAATA 
CGAGTTTCAT 
AATTAAAGGT 
CTAAAATCTT 
ATGAATTGTC 
TAACGCTGTT 
CGGTTGCAAA 
GAACTCTGAG 
TTGATATTGC 
TACGCCTAAA 
AA 



TAGTGATTTA 
CAATGAAAAA 
CGCAGAAAGA 
TAAAGGTGAA 
CAATTACAAC 
AGAAAGAAGC 
CATAAAAAAC 
TAAAATGGGA 
AACCATTCTA 
CTTTTTAACA 
AGGGCCGGCT 
AACGCAACAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
762 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : roisc_f eature 

(B) LOCATION 1. . .882 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

TTGTTGCTTT TTATCGTTGT GATCACCTCT TTGGTTAAAA ACACGATCCC AAATATTTGG 60 

CTCACTAAAA TCCTTTATAT GGCTATCTTG CTTTGCGCGA TCGCTCATTC TGTGGGGCYA 120 

ATCTTGCGYT GGTATGTGAG TGGGCATTCG CCTTGGAGTA ACGCTTATGA GTCCATGTTC 180 
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TATATCGCA? GGGCTTCTGT TATCGCAGGG TTTGTTTTAC GAMCTAAACT CGCGCTATCG 2 40 

GCTTCTAGCT TTTTGGCCGG TATCGCGCTC TTTGTGGCTC ATTTAGGCTT TATGGACCCT 300 

cSattggcc CTTTAGTGCC GGTGTTAAAA TCCTATTGGC TCAATATCCA TGTCTCTGTC 360 
ATCACCGCTA GTTATGGCTT TTTGGGCTTG TGTTTTGTGC TAGGGATTTT AAGTTTGGTT 
TTGTTTATT^ TGCGCAAACA AGGGCGTTTC AATTTAGACA AAACCATTCT TTCCATTAGC 

SSScStg' 2SgAG«T GATTTTAGGC CTGTTCATGC TCACAGCCGG GAATTTCTTA 540 

GGTOGG^S S£££££k ATCTTGGGGG CGCTATTGGG GGTGGGACCC TA^GAAACT 600 

TCCGCG TTG A TTTCTATTTG CGTCTATGCC TTGATCTTGC ATTTGCGTTT TTTAGGCTCT 660 

SSSSSc ccSXttt AGCGAGCAGC AGCGTGCTAG GGTTTTATTC GGTTTTAATG 720 

S^ATTT^ ggcgtgaatt ACTACCTTTC TGGCTTGCAC AGCTATGCCG CAGGKSATCS 780 

TTTGCCGATC CCTACTTTTT TATACTTTTT GGTAGCGATA CCTTTCGCTC TCGTATCTTG 840 

GCGTATTTCA AACGTCATTT GAGTTTGCCT AAATTGGTTT AA «^ 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

{3} TYPE . nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix> FEATURE: 

(A) NAME /KEY : misc_feature 
<B) LOCATION 1. . .429 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

c TdC AAATG A TCCACACGCA AGATTACATT AAAATGGAAG AAGCCGCCAC TGAGGCGATT 60 

^SiSS Stcttccat TTACTTGGGC ATGGATATTT TAAAAAATGG GGCTGACGCT 120 

™Stc£g ^ggIgcatag CGGGGCGACT ATGGGTTTAG CGACCTTGCG TTTAGGGCGT 180 

SSESS T?GAAAGGCC TGCTATTTGC ACTTTAATCC CTAGCGTTGG CAAACGCCCT 240 

AGCGTGCTAT TAGACGCAGG AGCGAACACG GATTGCAAGC CTGAATATTT GATTGATTTT 300 

GCTC?CA?GG GG^aSa CGCTAAAAGC GTGTTGCATT ATGACAGCCC TAAAGTGGGT 360 

CTTTTGAGTA ATGGTGAAGA AGATATTAAG GGGGGAATAC GCTCGTTAAA GAAACGCATA 420 
AAATGTTGA 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .43 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
ATGCTAGAAA TCAAGAATTT AAACTGCGTT TTAAACTCTC ATTTTTCGCT CCAAAACATC 60 
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AATATTTCTT TAAGTTATAG TGAAAGGGTG GCGATCGTGG GCGAAAGCGG GAGCGGGAAA 
AGCTCTATCG CTAATCTCGT CATGCGATTA AACCCTAGAT TCAAGTCCCA TAATGGCGAA 
ATCCTGTTTG AAACAACCAA TCTTTTAAAG GAAAGCGAAG CGTTYMTGCA GCATTTAAGG 
GGGAATATTA TCGCTTACAT CGCCCAAGAC CCCCTATCCA GCCTCAACCC CTTGCATAAA 
ATCGGCAAGC AAATGAGTGA AGCCTATTTT TTACACCATA AAAACGCTTC TCAAGTGTCT 
CTTAATGAAC AAGTTTTAAA CGTTATGAAA CAAGTTCAAT TGGATGAAAA TTTTTGGAAT 
GTTTCTCTTA TGT 



120 
180 
240 
300 
360 
420 
433 



(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
<iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .252 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 



ATGAACTACA AAGTTGCATC TGCTAGAAAT ATCGCAACGC TTCTTTTCTT ATTCTTTTCT 60 

CAAAGTGAAG CTTTTGATTT GGGTAAAATC GCTAAAATCA AAGCGGGTGC TGAAAGTTTC 120 

TCTAAAGTCG GTTTCAATAA CAAACCTATC AACAMTAATA AAGGGATTTA CCCTACCGAA 180 

ACCTTTATGA CGATTAATGG CTTACATGCA GGTGGATTTT ACGGAGCTCT TGCCCAAAAG 240 

CGCTACGGCT AA 252 

(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 393 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .393 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 



ATGGACGCTT TAGAAATCAC CCAAAAACTC ATCAGTTACC CTACCATTAC GCCCAAAGAA 60 

TGCGGTATTT TTGAATACAT TAAATCGCTT TTTCCTGCTT TTAAAACCCT AGAATGTGAA 120 

AAAAATGGCG TGAAAAACCT TTTTTTATAC CGCATTTTTA ACCCCCTCAA AAAGCATGCA 180 

GAAAAAGAAC ATGCAAAAGA AAAGCATGTR AAAGAAAATG TTAWGCCCTT GCATTTTTGC 240 

TTKGCAGGGC ATATTGRTGT CGTGCCTCCT GGGRRCAWTK GGCRRRSKGA TTCCTTTWWA 300 

YCCATCATTA AAGAGGGGTT TTTATACGGT CGTGGGGCGC AAGACATGAA GGGGGGCGTT 3 60 

GGGSSGTTTT TKAGGTGCRR GTTWAAATTT TAA 3 93 



(2) INFORMATION FOR SEQ ID NO: 97: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : rr.isc._f caturc 

(B) LOCATION 1. . .1023 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 



ATGATTTTAA 
GACGCCAAGC 
GGCGTTGTGC 
CGCGTTAAAA 
CAGCCAGGTT 
TCTTTGAATT 
ATCAATGAAA 
TTAATTTTAG 
AGCTTTGGGG 
CCCATAGTGG 
ATCCCTTTAA 
CGTTTGGAGG 
TATCATTTTC 
ATCAAACGCC 
AAGGCGTTTG 
TTTTGCAGCG 
CGCTTTATCC 
TGA 



GCATTGAAAG 
TCATCGCTCA 
CTGAGATTGC 
TAAGCTTGAA 
TGAGCGTTAC 
TACCCTTGAT 
AACAAACCCG 
AGGCTAGAGA 
AGAGTTTTGA 
AAAAATTAGC 
AAAACAGCCC 
TTGAAAAAAA 
AAAGCGCGGC 
CTAAAATTTT 
AGGATTTGTG 
ACAATGCCGC 
CTTTAGAAAA 



TTCTTGCGAT 
TTTTAAAATC 
ATCGCGCCTG 
TAAGGATTTT 
TTTAATAGAG 
TTTGGAAGAT 
CATGCCTTTA 
TTATGAAGAC 
TAAGGTTTCA 
CCTTGATTAT 
GAATTTGGCT 
CGCCCATAAT 
TATCGAGCAT 
TGGCATTGTG 
TGCTGAGTTT 
CATGATAGGG 
AGCCGATATT 



GACAGCTCTT TAGCCCTTAC AAGAATAGAG 
TCTCAAGAAA AGCACCACAG CTCTTATGGG 
CATGCTGAGA ATTTGCCGCT TTTATTAGAA 
TCCAAAATTA AAGCCATCGC TATCACTAAT 
GGTTTGATGA TGGCAAAAGC CTTGAGCTTG 
CATTTGAGAG GGCATGTGTA TTCGCTCTTT 
AGCGTGCTGC TAGTCTCTGG GGGGCATTCT 
ATTAAAATCG TTGCCACGAG TTTAGACGAT 
AAAATGCTTG ATTTAGGCTA TCCAGGAGGC 
GCACACCCAA ACGAGCCTTT AATGTTCCCT 
TTTAGTTTTT CAGGTTTAAA AAATGCGGTG 
TTGAACGATG AGGTAAAACA AAAGATTGGC 
TTAATCCAGC AGACTAAACG CTATTTTAAA 
GGGGGAGCGA GCCAAAATCT AGCCTTAAGA 
GATTGCGAGC TTGTTTTAGC CCCTTTAGAA 
CGATCAAGCC TAGAAGCTTA TCAAAAAAAG 
TCGCCAAGAA CGCTGTTAAA AAATTTTGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1023 




60 
120 
180 
240 
300 
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GAGCGCGATT ACTTAGAAGA AAAAATCATT MYTTRGAAAA CAAWTTTWAA GACATGGGGC 
ATTATGCCGC TAGCGATGAA GTCAACGGAA AAACAGGTTT TGAAAATGTA TCAAGAAGGC 
TATAGCGTGG ATTCTATTTC TAAAGAATTT AAAGTGAGTA AGGGCGAGGT GGAATTTATA 
TTGAACATGG CAGGGTTAAA ATGGTAG 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ivi ANTI SENSE: MC 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME/ KEY : misc_£eature 

(B) LOCATION 1. . .366 

(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 99 

TTGGATCCCT TTAGCCATAA GGAGAATTTT TTAGCCGTTG AAACCTTTAA AATGCTAGGC 

AAAACAGAAA GTAAAGACAA TCTTAATTGG ATGATCGCTT TGATCATTGA AAAAGACAAG 

GTCTATGAGC AAGTGGGATC GGTGCGTTTT GTGGTGGTTG TAGCGAGTGC TATCATGGTG 

TTAGCCTTAA TCATAGCGAT CACTCTTTTA ATGCGAGCGA TCGTGAGCAA TCGTTTGGAA 

GTCGTTTCTA GCACCTTGTC TCATTTCTTT AAATTATTGA ACAATCAAKC CCATTCTAGC 

RACAYTAAAT TGGTTRAAGC GCGATCTAAT GACGAATTAG GGCGCAYGCA AACASCTGAT 
YAATAA 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .450 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 

ATGGAATTTT ATCAAGTCTA TGACCCATTA GGCCATATTT GGCTGAGCGC TTTAGTCGCA 
CTTTCGCCTA TTGCGCTCTT TTTTATCTCT CTTATTGTCT TTAAACTTAA AGGGTATAGC 
GCTGGGTTTT TAAGCTTAGC GCTTTCAATC CTTATTGCGT TATTTGTGTA TAAAATGCCT 
GTTCAAATGG TGAGCGCGAG TTTTTTCTAT GGCTTTCTTT ATGGCTTGTG GCCGATCGCA 
TGGATTGTGA TCGCTGCGAT TTTTCTTTAC AACCTTTCAG TGAAGTCCGG GTATTTTGAG 
ATTTTAAAAG AAAGCATTTT AAGCTTGACG CCGGATCATC GCATTTTAGT GATTTTGATC 
GGGTTTTGTT TTGGCTCGTT TTTAGMMGGC GCGRTTGGTT TTGGAGGCCC GGTAGCRATC 
ACAGCGGCGA TTTTAGTGGC CTTGGGCTAA 

(2) INFORMATION FOR SEQ ID NO: 101: 



60 
120 
180 
240 
300 
360 
366 



60 
120 
180 
240 
300 
360 
420 
450 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY. mi 3 c_ feature 

(B) LOCATION 1 . . .97 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 



ATGAAAAGAA 
CATAATTTGG 
TGGAGGAGCA 
CTCACTAAAA 
ATTAGAAACT 
GGAGGCCCAG 
GCGCCCTATG 
ATCAGCGTAA 
ATCAACATCA 
ACTTTTTGGG 
AAAAGCTTAG 
AAGCATTTTG 
AGCCCTACGG 
AAAATCACCG 
ATAGCCGCTT 
AGAGCGAAGC 
GGGGGATTTC 



TTTTAGTTTC 
AAAGGGTGGA 
AGGAAGTTAG 
GCGCCAATCA 
CTACCGGTAT 
GGCATTCTAA 
TTGAAATTGG 
CTAAGGGTGG 
TCACCAAAGG 
GCAAGTCTGA 
TTAATAACAT 
GAATCCAAGC 
ATATTCAAAA 
CTTTTTTTCA 
ACAATCAAAA 
GATGGGGAGC 
ACTTTTAG 



TTTGGCTGTT 
AGCTTCAGGG 
AAATTATATG 
AAGCATTGAA 
TGGAGCTGTG 
TACGGGAATG 
CACGGTTATT 
GGAGAGCGTG 
CATTCCTACC 
AAACGGGGGC 
GCTTTTTAAC 
TCAAGTCAAT 
TTACATGTTA 
ATATTATAGT 
TCGTTTTCAA 
TGTGTATCAA 



TTGAGTCATA 
GTGGCTAACG 
GGTTCTCGCA 
GAAGCTTTGC 
CCTAGCATTT 
ATTCTAGTCA 
TTTCCTGTAA 
CGTTATGGCC 
AATTGGGAAA 
TTTTTCAATC 
ACCTATTTAA 
TGGCTCAAAG 
GATTCATTGT 
TATTTCTTGA 
AACAACCGCC 
AACTTTTTTG 



GCGCGCATGC 
ATAAAGAAGC 
CGGTGATTTC 
AAAATGTGCC 
CCATTAGGGG 
ATGGGATTCC 
CCTTTCAGTC 
CTAACGCTTT 
GTCAGGTGAG 
AAAATTCTAA 
GAACGGGGGG 
GGCAAGGGTT 
ATCAAATCAA 
CAGACCCTGG 
CCAATAACGA 
GGGACACGGA 



TGTCAAAACT 
GCCTTTAAGC 
TAACAAGCAA 
AGGCGTGCAT 
GTTTGGTGCT 
TATTTATGTC 
TGTGGATAGA 
TGGCGGTGTG 
CGAGAGGACC 
AAACATTGAT 
TATGATGAAT 
TAGATACAAC 
TGATAGCAAT 
ATCTTTAGGC 
TAAAAGCGGG 
TAGGGTAGGG 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .759 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 

TTGCGTTCAA TTTCAAGGAT AAAGATGCTT TCAGTGTATG 
AAAAGGGTGC TTGAAGAATG GCTTTTAAGC GAAGACATTT 
GCTTTAGAAA GGGCGGTTTT ACAAAACGCT TCTTTGGGCG 
GGGAGTGGGG ATAATGGAGG TGATGGCTAT ACTCTAGCCA 
AAAACGCTGG TCTTTGAAAT GAAATTAGCA AAAAGCCCCA 
AGGGCTAAAA AAGTAGGGGT AGTCATCAAA GCATGGGAAG 



AAAAAGGGAA 
TAATGGAAAA 
CTAAGGTCAT 
GGCGTTTAGT 
TGTGCCAATT 
AAAAGAATGA 



TGCCCTAGAC 
CGCCGCTATG 
TATTCTTTGT 
GGGGCGTTTT 
GCAAAAAGAA 
AGATTTAGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
978 



6 
12 
18 
24 
30 
36 
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TGCGATGTGT TAGTAGATTG CGTGGTAGGG AGCGCTTTTA AGGGCGGATT AGAGCCGTTT 420 

TTAGATTTTG AAAGCCTTTC TCAAAAAGCA CGCTTTAAAA TCGCTTGCGA CATTCCTAGC 480 

GGGATAGATT CTAAAGGCAG GGTGGATAAG AGGGCGTTTA ARGSCGGATA CCGACTATCA 540 

GCATGGGCGC TATTCAAGTC ATGCTTACTA AGCRATAARR CTAAARACTA TATARRRRAA 600 

TTGAAARTRR RRCATTTARR RGTTTTTAAT CAAATTTATG AGATCCCAAC ARACACTTTT 660 

TTACTMGAAA AAARCGATTT GAARCTGCCC TTAAGGGATA GAAAGAAACG CTCACAAAGG 720 

CGATTACGGG CATGCGCATG TGCTTTTGGG CAAGCATAG 759 

(2) INFORMATION FOR SEQ ID NO: 103: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1 . . .417 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

ATGGCATTAG ACAAAAGGAT TTGGATGCAY TTTGATCTTT TGCCTTTTGT GTTTATCATC 60 

CCCTTGTTGG TGGTTTCTTT TTTGTTGATT TTTGAGAGTA GTGCGGTTTT GAGCTTGAAG 120 

CAAGGGGTTT ATTATGCCAT AGGGTTTCTT CTCTTTTGGG TAGTGTTTTT TATCCCTTTC 180 

AGGAAACTCG ATCGGTGGCT CTTTGCGCTT TATTGGGCGT GCGTTATTTT ATTAGCGTTA 240 

GTGGATTTTA TGGGATCGAG CAAGCTTGGA GCGCAGCGAT GGCTAGTCAT TCCTTTCACT 300 

TCTATCACCT TACAGCCTAG CGAGCCTGTG AAAAATCGCY ATTCYTTTAT TGTTGGCGCA 360 

TTTGAKYAAA ATYAACCCGA CCYCCTTTTA AGGGCTATGA TTGGGGCATG TTTTTAA 417 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1 . . .981 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

GTGTTAATGG CGTTGARCGA TAAACGCTAC GGCTTAGAAG CAGGGATCAA GTATTTCACC 60 

ATGGGGGCGA TGGCGAGCGC GTTTTTTGCT ATGGGCGCGA TGGCTTTTTA CCTGCTTACA 120 

GGGAGCTTGA ATCTTGAAGT CATTACCCTA TACTTACACA CTGAGGGCAT CACAAACCCC 180 

ATGCTCTTTG CGATGGGCAC TATTTTTTTG ATTGGAGCGA TTGGCTTTAA GGTTTCTTTA 240 

GTGCCTTTCC ATACCTGGAT GCCTGATGTG TATGAGGGYA ATAACCCAGT CTTTGCGAGC 300 

TATATTTCCA TTGTGCCTAA AATCGCTGGC TTTGTGGTAG CGACTCGCCT TTTTGGGGCG 3 60 

TTTATAGACA CTCATACCGC TTGGGTAGAA GACATTTTTT ATGTTTTGAT CCTTATGACT 420 
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ATCACCATCC 
AGTTCTATTT 
CAACAAGCGA 
CTTTTATGGC 
TCTAAATTCA 
GTTTTTGGGC 
GAAAGCGCGT 
GTGGCTGCGT 
ACCCAAAGCT 
TCATTATTGC 



CTAATTTCAT 
CGCATTCTGG 
TGTTTGTTTA 
TCTTAAAAAG 
ACGGCCTTAT 
TTGCAGGGAT 
TAGAGAGCAA 
TTTATTATTT 
ACGCCAAAAC 
CATGGCGTTA 



TGCTTTATGG 
GTTCGCTTTA 
TTGGTTCATG 
CCGGGAAAAA 
CAAAACCCAC 
CCCGCCTTTT 
TCACATTCTT 
CCGTTGGCTC 
GATATTTACA 
G 
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CAAGAAGATG 
GCGTGCGTGT 
TTTGCCTTCA 
ACATGGGATG 
CCCTTAGTGG 
AGCGTGTTTT 
TTAGCGGTGG 
GTGGCGATGT 
CCCAAAACGC 



TCAAAAGGAT 
TTATCCACAC 
CTTACATTGG 
AACGCTACGA 
CGATCTTGGG 
GGGGGAAATT 
TGATGTTAGT 
TTTTCAATAA 
CACCATGCCC 



GCTCGCTTAT 
TGAAGATAGC 
GGCTTTTGGC 
TCACCCCTAT 
CGCTATTTTT 
TTTAGCCGTT 
TAATAGCGCG 
GCCCTTACAA 
ATTTATGCGG 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY ; circular 

(ii) MOLECULE TYPE: DNA (genomic} 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION 



misc_feature 
1. . .723 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 



ATGATAAACT 
GACAGAAATG 
AAGGCGTTTT 
CAAAAATTGC 
AAGCCTGAAA 
TTAATCAATA 
GACTGGGAAG 
TTGCCCTCTA 
ACTTACGCCT 
TCTYYAAATT 
GGGTGTGTGG 
AAATCCGGCT 
TAG 



CAAAGAAAAG 
GCGCACATTT 
TACAAAAAAA 
AGCTCGCTTA 
CTAAGCGAGC 
ACGCCGGCTT 
TCATGATAGA 
TGATAGAGCA 
ATCCTGGAGG 
TGCGAGCGGA 
CGAAACCGAA 
ATGAAAAACA 



CTTGAAAAAG 
TAGTTGCGGA 
CCATGTGGTT 
CCCTAAGCGT 
GATAGAAACT 
AGCGCTAGGC 
CACGAATATC 
TGACCAAGGG 
GAAKGTCTAT 
CGTGGCTGGC 
AGTCAGCAGG 
CCCCTTACCC 



GGCTTAAGGG 
GCGACTTCAG 
TTTGGCACAG 
TTCATTCCCC 
ATTTTTTCCA 
TTGAACAAGG 
AAGGGGTTGT 
ACTATCATCA 
GGAGCGAGCA 
ACTAACACTA 
GTGCGGGGTA 
CAAACCACAA 



GCTTTTTTAA 
GGTTTGGGCT 
GGAGGCGGCA 
TGTGTTTTGA 
TGACGGATCG 
CTTATGAATG 
TGCATCTCAC 
ATCTTGGTTC 
AGGCGTTKGT 
GAGGGAGAAG 
AAGGCGATAA 
GACAAGGGCT 



AATTTTAAAG 
AGAAATCGCT 
AGAGAATTTA 
TCTTCAAAAC 
CATTGACGCT 
CGAGTTAGAC 
CCGCTTGATC 
TATCGCTGGC 
GAAACAAYYY 
GTGGAACCCG 
ACCAAAGCCC 
AACATCGGGC 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .615 



480 
540 
600 
660 
720 
780 
840 
900 
960 
981 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
723 
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(xii SEQUENCE DESCRIPTION: SEQ ID NO: 106 



GTGAGCGGGG TGGTGTTAAG 
ATCACTTATC AATTGGGCTT 
TTAGACGTGT TTATGCCTGA 
TCGCTCGCTG AAAAAACCGC 
AAGCTCAAAA AAGGGCAATT 
AAATTAGGCT CTATGAGTTC 
GCGCTAAAAG ACACGGATTT 
AATTCCATGA CGAAAAAAGA 
AGGATCGCTT TAGGGARCGG 
GATCAGGCGA GCAAAATGGC 
AATCTAWCGA KCTAG 



CAAATTTGAT AGCGATTCTA AAGGGGGTAT CGCCTTAGGC 
ACCCTTGCGT TTTATTGGGA GTGGGGAAAA AATCCCTGAT 
AAGGATTGTG GGGCGTTTGA TGGGGGCTGG AGATATTATC 
CAGCGTTTTA AACCCTAATG AAGCCAAAGA TTTAAGCAAA 
CACTTTCAAC GATTTTTTAA ACCAAATTGA AAAAGTGAAA 
TCTRATCTCT ATGATTCCAG GTTTAGGGAA TATGGCAAGC 
AGAAAGTTCT TTAGAAGTGA AAAAAATCAA GGCCATGGTT 
GCGAGAAAAC CCCGAGATTT TAAACGGCAG CCGAAGAAAA 
CTTAGAAGYG YCTGAAATCA ATCGCATCAT CAAACGCTTT 
GAAACGACTC ACGAATAAAA AGGGTATTAG CGATYTGATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
615 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

( A i LENGTH: 275 base pai-Ls 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .279 

jxi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

GTGGAAAAAG CCCATCCGGA TGTGTTTAAC CTCTTGTTAC AGGTTTTAGA TGAGGGGCAT 60 

TTAACCGATA GTAAGGGCGT GAGGGTGGAT TTCAAAAACA CGATTTTGAT TTTAACCAGC 120 

AATGTGGCTA GCGGCGCGCT TTTAGAAGAG GATTTGAGTG AAGCCGATAA ACAAAAAGCG 180 

ATCAAAGAGA GCCTGAGACA ATTCTTCAAG CCGGAATTTT TAAACCGCTT AGATGAAATC 240 

ATCTCCTTTA ACGCCCTAGA TAGTCATGCT ATCATCTAA 279 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.f eature 

(B) LOCATION 1 . . .246 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 8 

TTGGTGTTTT TAGACAGGCG TTTGATTGTG ATGGTTACGG ACTCTAAAGG GAGTCGTTAT 6 
ATTAATGTGC ATATCTTATT CCGCCAAATC AGTTTGTATG CGCTGTTGAG CGTTGTGGGA 12 
TCTTTATTGT TTTTAGGCGT TTCATTACTG GTTTTAAATA AAGAAATTAA AAACATTGAA 18 
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AAACAGCATG CTTTAMTCAC TAAGGAATTT GAGAAAAAAA GAGAGACGAA TGAAMAGCTT 24 0 

TCTTYG 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .702 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 109 
TTGAGTTTGA TGASTGTGTT AAATGCCAAA GAATGCGTTY CGCCCATAAC AAGAAGCGTT 60 

Iagtatcatc agcaaagtgc sgagatcaga gccttgcaat tacaaagtta caaaatggcg 120 
JSggcgc tagacaataa ccttaagctc gttaaagaca aaaagccagc cgtcatcttg 180 

PATTTAGATG AAACCGTTTT GAACACTTTT GATTATGCGG GCTATTTAGT CAAAAACTGC 240 

CCCCAGAAAC TTGGGATAAA TTTGAAAAAG AAGGCTCTCT TACGCTCATT 300 

rrTCGAGCGC TAGACTTTTT AGAATACGCT AATTCTAAGG GCGTTAAGAT TTTTTACATT 360 

SSSSSS 3SK^ TAAGGCATTC ACTTTAAAAA CGCTCAAAAG CTTTAAGCTC 420 

CCCCAAGTGA GTGAAGAATC CGTTTTGTTA AAGGAAAAAG GCAAGCCTAA AGCCGTTAGG 480 

rrrGAGTTAG TCGCTAAGGA TTATGCGATT GTTTTACAAG TGGGCGACAC. TTTGCATGAT 540 

S?!SS5c5S StoctSa AGACGCTAAA AACAGCCAAG AACAACAAGC CAAAGTCTTG 600 

TGGGAAGATG GGCCTATAAA AGCATGGCAA AATAAAAAAT AA 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 
iTrTTfRCGG CTGGTTTGAC TTTGCCTGAA TTTGGGTGTT ATTTAAGCCA TTATCTTTTA 6' 

tggHSgS? Etcaaccg GTCGTTATTT tagaagatga ^TAACGCTA 12 

PARTCTCATT TCATGCAAGC CTTAGAAGAT TGTTTGAAAA GCCCTTTTGA TTTTGTGAGA 18 
?SS?SS SIS TTACCAACGA GACAAAATTC CATGTTTTGC CCAAAGAATT 24 
TGTATTTCCT CCCTTTGA 

(2) INFORMATION FOR SEQ ID NO: 111: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

\A) NAME/ KEY: mis c_ feature 

(B) LOCATION 1. . .348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

TTGATCGCTT TGAGAGTAAC GGCTTGGAAR GTGGYGGCCA TGAAACGCTT GCATTTGAGC 60 

GTGAAAGACG CTGAAAACTT TGATGCGATC CTCAGAGAGA GACCCTTTTT TAAAGATTTG 120 

ATAGAGTTTA TGGTGAGTGG TCCGGTGGTG GTTATGGTTT TAGAAGGCAA AGATGCGGTG 180 

GCTAAAAACA GAGAGCTTAT GGGAGCGACT GATCCCAAAC TCGCCCAAAA AGGTACTATC 240 

AGAGCGGATT TTGCTGAGAG CATTGACGCT AATGCGGTGC ATGGGAGCGA TAGCTTGGAA 3 00 

AACGCGCACA ATGAAATCGC TTTCTTTTTT GCCGCTAGAG AGTTTTAA 348 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .1185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 112 

TTGATGTGGC TCAAAACGCT TACACTTCAA ACGCTCAATA CCGACAAAGC CTTGCAAGAA 60 

TTTTCTAAAA CGATGGAGGC GTTTAAAACC AAACTCATCC AATCCGCTAA CGATGTGCAT 120 

TCAGAGACTT CTCGCGCCGC TATCGCTAAC GATTTAGAAC GCTTAAAAGA GCATATGATA 180 

AATGTCGCTA ATACCTCCAT AGGGGGGGAA TTTTTATTTG GAGGCAGTAA GGTGGATAGA 240 

CCCCCCATTG ATAGTAATGG GAAATACCAT GGCAATGGCG AAGATTTAAA CGCGCTTATT 300 

AGCTCTGATA ACCTTGTGCC TTATAATATC AGCGGGCAAG ATTTGTTTTT AGGCACCGAT 3 60 

AAAGACAAAC ACAAACTCAT TACCACCAAC ATTAAATTAC TCAATCAAAA CAAGCTCCAM 420 

CCTGATGTGA TGGACGCTTT AGAGCATTCT TCATTGCCTG AAGAAGTTTT CATTAAACCC 480 

AGCGATACCT TGCGAGAACT CATCGGCGAT AACGATAAAA ACCCCACCAA TGACCCTAAA 540 

GAGTTTTTTT ATTTGCAAGG CATTAGGCCT GATGGCTCTA GTTTTAAAGA AAAATTCGCG 600 

TTGGATAAAG CCTATCAAAA CCAAGAAAGC GCGACTAAAG TGAGCGATTT GTTGGATAAA 660 

ATCGGGCATG CTTACGGGAA CACTTCGCAA AATAAAGTCG TGGATGTGAG TTTGAACAAT 720 

TGGGGGCAAA TTGAGATTAA AAACCTAACC CCCGGCAGTG AAAATTTGGA TTTTCATTTG 780 

ATTTCTAGCG ATGGGGATTT TGACGATTTA GACGCCTTGC GTTCGAGCGG TAAAAGGGTT 84 0 

ACTGAATATG TCAAAAGCGC GTTTGTAACG GATAGGAGTT TGAGCCAAGT TAAAGCGGTG 900 

CCTAACATGT ATAACCCTAA AGTGCTTGAA ATCCCTAGCG TGTTTGTGAC TAAAGACAAT 960 

GTTTTAGCTA ACAAAAACAC CAAGTTGAGC GAGATTTTTG GMGATAAGGT GGAAACTTTA 1020 
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AAAATCAACG CCAGCCGTTT GGGCGATGAA AGCGCTATTA AAATCCCAAA CCTCCCTATT 1080 

AATTTGGATA TTCCCATTCT TTTAGATGTG AAAAACTCTA CGATTAAAGA TTTGAAAGAC 1140 

GCGATTAAAG AACGCTTCAA TAATGAAGGT GGATGTGGAA ATTGA U8b 

(2) INFORMATION FOR SEQ ID NO: 113: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .309 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

TTGAAGGCAT TAAACGACTG CATGGTATTT TTTCATAAGA AAATTATTTT AAATTTTATC 60 

TATTCTTTAA TGGTTGCTTT TTTATTCCAT TTATCCTATG GGGTTCTTTT AAAAGCCGAT 120 

GGAATGGCTA AAAAGCAAAC TCTTTTAGTG GGTGAAAGGC TTGTGTGGGA TAAGCTCACG 180 

CTGTTAGGGT TTTTAGAAAA AAACCATATC CCCCAAAAAC TCTACTACAA TTTGAGCTCT 240 

CAAGATAAAG AATTGAGTGC TGAAATCCAA AGCAATGTTA CCTACTACMA CTTTAAGAGA 3 00 
TGCAAATAA 



(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .1092 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

ATGAAATTTT TTCTTTTAAA GAAATTCAGC RAATTTTTAA ACACTCAAAC GCATTTTAAC 60 

CTCAAACGCT TGAACGCGTC TAGTTTTTTA TTAGAGACTT TTTCTAAAGA AAAACACGCC 120 

TTTGTTGTGG ATTTGAGCGC GCCTTATATT GGTTTGTCCA AAAAACCCCC AGAGAGCGTT 180 

TTAAAAAACA CTTTAGCGTT AGATTTTTGT TTGAATAAAT TCACCAAAAA CGCCAAAATT 2 40 

TTACAAGCAA ACGTCATTGA TAACGATCGG ATTTTAGAAA TCAAGGGCGC TAAAGATTTA 3 00 

GCTTATAAGA GTGAAACTTT TATTTTGCGT TTAGAAATGA TCCCTAAAAA AGCCAATCTC 3 60 

ATGATTTTAG ATCAAGAAAA ATGCGTGATA GAGGCTTTTC GTTTTAATGA CAGGGTGGCT 420 

AAAAACGATA TTTTAGGGGC ATTGCCTCCT AATATTTACG AGCATCAAGA AGAGGATTTG 480 

GATTTTAAGG GATTGTTGGA CATTTTAGAA AAAGATTTTT TATCCTATCA GCACAAAGAA 540 

TTGGAACACA AAAAAAATCA AATCATCAAG CGATTAAACG CCCAAAAAGA ACGCTTGAAA 600 

GAAAAATTAG AAAAACTAGA AGATCCTAAA ACTTTACAGC TGGAAGCGAA AGAATTGCAA 660 

ACTCAAGCCT CATTGTTGCT CACTTACCAG CATTTAATCA ACAGGCGTGA AAATCGCGTG 720 
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ATTTTAAAGG ATTTTGAAGA TAAAGAATGC ATGATTGAAA TTGATAAGAG CATGCCCTTA 7 80 

AACGCTTTTA TCAATAAAAA ATTCACTCTC AGCAAGAAAA AGAAACAAAA ATCGCAATTC 840 

TTGTATTTAG AAGAAGAGAA TCTGAAAGAA AAAATCGCTT TTAAGGAAAA TCAAATCAAC 900 

TATGTTAGAG ACGCTGCAGA AGAAAGCGTT TTAGAAATGT TTATGCCGGT AAAAAACTCT 960 

AAAATCAAAC GCCCGATGAA CGGGTATGAA GTGCTGTATT ATAAGGATTT WAAAWCGGGT 1020 

TTRGGAAAAA CCAAAAAGAG AATATCAAGC TTTTACAAGA CGCAARARCG AATGATTTKT 1080 

GGATGCAYKT GA 1092 

(2) INFORMATION FOR SEQ ID NO: 115: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



till) HYPOTHETICAL: ¥E£ 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .172 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



Met 


Lys 


Gly 


Pro 


He 


Leu 


Trp 


Pro 


Ala 


Phe 


Ser 


Gin 


Phe 


Ser 


Asp 


Gin 


1 








5 










10 










15 




Asp 


Leu 


Ser 


Asp 


He 


Val 


Ala 


Tyr 


Leu 


Thr 


Ser 


He 


Leu 


Pro 


Lys 


Asn 








20 










25 










30 




Leu 


Ser 


Asp 


Lys 


Glu 


Val 


Phe 


Ala 


Gin 


Ser 


Cys 


Gin 


Arg 


Cys 


His 


Ser 






35 










40 










45 






Leu 


Asp 


Tyr 


Ala 


Lys 


Asp 


Lys 


Ala 


Phe 


Ser 


Asp 


Pro 


Lys 


Asp 


Leu 


Ala 




50 










55 










60 








Asn 


Tyr 


Leu 


Gly 


Ser 


His 


Ala 


Pro 


Asp 


Leu 


Ser 


Met 


Met 


He 


Arg 


Ala 


65 










70 










75 








80 


Lys 


Gly Glu 


His 


Gly 


Leu 


Asn 


Val 


Phe 


lie 


Asn 


Asp 


Pro 


Gin 


Lys 


Leu 










85 










90 








95 




Leu 


Pro 


Gly 


Thr 
100 


Ala 


Met 


Pro 


Arg 


Val 
105 


Gly 


Leu 


Asn 


Glu 


Lys 
110 


Ala 


Gin 


Lys 


Gin 


Val 


He 


Ser 


Tyr 


Leu 


Glu 


Lys 


Ala 


Gly 


Asp 


Arg 


Lys 


Lys 


His 






115 










120 










125 






Glu 


Arg 
130 


Asn 


Thr 


Leu 


Gly 


He 
135 


Lys 


He 


Met 


He 


Phe 
140 


Phe 


Ala 


Val 


Leu 


Ser 


Phe 


Leu 


Ala 


Tyr 


Ala 


Gly 


Lys 


Glu 


Lys 


Phe 


Gly Ala 


Lys 


Cys 


He 


145 










150 










155 






160 


Lys 


Phe 


Lys 


Lys 


Gly 
165 


Gly 


Thr 


Trp 


Phe 


Tyr 
170 


Asp 


Phe 











(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .61 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

Met Gin Glu Phe Ser Leu Trp Cys Asp Phe lie Glu Arg Asp Phe Leu 

15 10 15 

Glu Asn Asp Phe Leu Lys Leu lie Asn Lys Gly Ala He Cys Gly Xaa 

20 25 30 

Thr Ser Asn Pro Ser Leu Phe Cys Glu Ala He Thr Lys Ser Ala Phe 

35 40 45 

Tyr Gin Asp Glu He Ala Lys Xaa Gin Arg Gin Lys Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .286 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 



Leu 


Xaa 


Pro 


Met 


Lys 


Val 


He 


Gin 


Val 


Phe 


Leu 


Phe 


Ser 


Asn 


Pro 


Phe 


1 








5 










10 










15 




Cys 


Ala 


He 


Val 


Pro 


Asn 


Thr 


Glu 


Pro 


Glu 


His 


Leu 


Glu 


His 


Tyr 


Asp 






20 










25 










30 






His 


Asp 


Leu 


Glu 


Arg 


Phe 


Phe 


Phe 


Ala 


Tyr 


Lys 


Tyr 


Phe 


Leu 


Asp 


His 




35 








40 










45 








Ala 


Gin 


Lys 


Arg 


Val 


He 


Tyr 


Lys 


Glu 


Asp 


Pro 


Phe 


Leu 


Lys 


Asn 


Tyr 




50 






55 










60 






He 


Gin 


Ser 


Lys 


Asp 


Ala 


He 


Val 


Leu 


Glu 


Lys 


Lys 


Asp 


He 


Tyr 


Asn 


65 






70 










75 










80 


Tyr 


He 


Leu 


Lys 


Asp 


Gly 


Glu 


Pro 


Tyr 


Thr 


Ser 


Phe 


Glu 


Leu 


Lys 


Asn 








85 










90 










95 




Leu 


Gly 


Ala 


Phe 


Leu 


Val 


Trp 


Gly 


Leu 


Gly 


Glu 


His 


Asn 


Ala 


Thr 


Asn 






100 










105 










110 




Glu 


Ala 


Ser 


Leu 


Ala 


He 


Leu 


Ser 


Ala 


Leu 


Asp 


Glu 


Leu 


Asn 


Leu 


Glu 






115 










120 










125 








He 


Arg 


Asn 


Asn 


Xaa 


Leu 


Asn 


Phe 


Lys 


Gly 


He 


Lys 


Lys 


Arg 


Phe 


Asp 




130 










135 










140 






His 


His 


He 


Leu 


Gin 


Lys 


Asn 


Asn 


Leu 


He 


Leu 


He 


Asp 


Asp 


Tyr 


Ala 


145 








150 










155 










160 


Pro 


Thr 


Glu 


lie 


Gly 


Xaa 


Thr 


Leu 


Lys 


Ser 


Ala 


Arg 


lie 


Tyr 


Ala 


Asn 










165 










170 










175 




Leu 


Leu 


Asn 


Thr 


Gin 


Glu 


Lys 


lie 


He 


Val 


He 


Trp 


Gin 


Ala 


His 


Lys 








180 










185 










190 






Tyr 


Ser 


Arg 


Leu 


Met 


Asp 


Asn 


Leu 


Glu 


Glu 


Phe 


Lys 


Lys 


Cys 


Phe 


Leu 




195 










200 










205 








Glu 


His 


Cys 


Asp 


Arg 


Leu 


He 


He 


Leu 


Pro 


Val 


Tyr 


Ser 


Ala 


Ser 


Glu 




210 








215 










220 










Val 


Lys 


Arg 


Asp 


lie 


Asp 


Leu 


Lys 


Ala 


His 


Phe 


Lys 


His 


Tyr 


Asn 


Pro 


225 




230 










235 










240 


Thr 


Phe 


He 


Asp 


Arg 


Val 


Arg 


Lys 


Lys 


Gly Asp 


Phe 


Leu 


Glu 


Leu 


Leu 








245 










250 










255 




Val 


Asn 


Asp 


Asn 


Val 


Val 


Glu 


Thr 


lie 


Glu 


Lys 


Gly 


Phe 


Val 


He 


Gly 






260 










265 










270 






Phe 


Gly 


Ala 


Gly 


Asp 


He 


Thr 


Tyr 


Gin 


Leu 


Arg Gly Glu 


Met 










275 






280 










285 








(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 118: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1 . . .61 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 

Leu Leu Leu Phe Phe Leu Leu Lys Gly Val Val Phe Ser Leu Gly Phe 

1 5 10 15 

Phe Ser Phe Phe Glu Glu Val Ser Gly Ser Phe Xaa Ala Val Ser Leu 

20 25 30 

Xaa Val Leu Ala Leu Val Met Gly Ser Ser Xaa Gly Leu Glu Glu Phe 

35 40 45 

Cys Val Leu Glu Glu Leu lie Asn Ser Gly Leu Ser Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 119: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc.feature 

(B) LOCATION 1 . . .122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 



Met 


Gly 


Phe 


Leu 


Lys 


Val 


Leu 


Lys 


His 


Asp 


Ala 


Leu Gly Gin Val 


Gly 


1 






5 










10 










15 




Asn 


He 


Val 


He 


Gly 


Asn 


Phe 


Leu 


He 


Thr 


Leu 


Thr 


Val 


Leu 


Ala 


Val 








20 








25 










30 






Cys 


Phe 


Ser 


Ser 


Gin 


Ser 


Ala 


Glu 


Glu 


Thr 


Thr 


Met 


Leu 


Thr 


Leu 


Ser 




35 










40 










45 








Tyr 


Thr 


Leu 


Phe 


Phe 


He 


Leu 


Gly 


Ala 


Phe 


Leu 


Leu 


Val 


Ala 


He 


Ser 


50 










55 










60 










Val 


Gly Ala 


He 


Lys 


Asn 


Leu 


Asn 


Ala 


Leu 


Phe 


Ser 


Lys 


Arg 


Gly Val 


65 








70 










75 










80 


Leu 


Ser 


Phe 


Ser 


Leu 
85 


Pro 


He 


Ser 


Leu 


Glu 
90 


Ser 


Leu 


Leu 


Leu 


Pro 
95 


Lys 


He 


Leu 


Leu 


Pro 
100 


Xaa 


Val 


Phe 


Phe 


Tyr 
105 


Leu 


Gin 


Phe 


Val 


Leu 
110 


Val 


Cys 


Gly 


Glu 


Arg 
115 


Ala 


Phe 


Gly 


Leu 


Leu 
120 


Pro 


Phe 















(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

Ui) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
(B> LOCATION 1 ... 187 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 

Met Leu Lys Thr His Leu Ser Ser Ala Arg Gly Val Val Val Leu Ser 



lu 



Lys lie Leu Pro Val Asn Val Val Leu Met Val Ser Val. Arg Leu Phe 

Glu Lys Glu llu Lys Arg Lys Pro Tyr Tyr He lie Ala Ser Ala His 

Ser Asp Glu Gly Leu Glu Lys Leu Lys Lys Xaa Gly Xaa Asp Met Val 

50 55 

Xaa Xaa Pro Thr Lys Leu Met Ala Gin Arg Val Ser Ala Asn Xaa Trp 

65 70 75 , 

Cys Xaa Leu Asp Met Glu Asn lie Leu Glu Arg Phe He Asn Lys Lys 

85 90 
Asp Thr Leu Leu Asp Leu Glu Glu Val lie Val Pro Lys Thr Ser Trp 

100 105 , 

Leu Val Leu Arg Lys Leu Lys Glu Ala His Phe Arg Glu lie Ala Lys 

115 120 
Ala Phe Val He Gly lie Thr Gin Lys Asp Gly Lys Tyr lie Pro Met 

130 135 
Pro Asp Gly Glu Thr lie lie Ala Ser Glu Ser Lys Leu Leu Met Val 
145 150 155 lou 

Gly Thr Ser Glu Gly Val Ala Thr Cys Lys Gin Leu lie Thr Ser His 

165 170 
Gin Lys Pro Lys Glu Val Asp Tyr He Ser Leu 
180 185 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY : misc„f eature 

(B) LOCATION 1. . .193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

Val Gly Ser Phe Leu Phe Val Gly Pro Ser Gly Val Gly Lys Thr Glu 

Leu Ala Lys Glu Leu Ala Leu Asn Leu Xaa Leu His Phe Glu Arg Phe 

Asp Met Ser llu Tyr Lys Glu Ala His Ser Val Ala Lys Leu He Gly 

Ser Pro Ser Gly Tyr Val Gly IL Glu Gin Gly Gly Leu Leu Val Asn 
50 55 60 
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Ala 


He 


Lys 


Lys 


His 


Pro 


His 




65 










70 




Lys 


Ala 


His 


Pro 


Asn 


Val 


Tyr 


Asp 










85 






Ala 


Thr 


Leu 


Ser 


Asp 


Asn 


Leu 


Glv 








100 








He 


Leu 


He 
115 


Met 


Thr 


Xaa 


Xaa 


Val 
120 


Gly 


Phe 


Phe 


Ser 


Thr 


Lys 


Asn 


Ala 




130 








135 




Leu 


Leu 


Thr 


Pro 


Glu 


Leu 


Arg 


Ser 


145 










150 




Asn 


Ala 


Leu 


Ser 


Leu 


Glu 


Asp 


Phe 










165 






Gly 


Arg 


Val 


Lys 


Ser 


Pro 


Ser 


Thr 



180 



Pro 



134 



Leu 




Leu 


Leu 


Asp 


Glu 


lie 


Glu 






"7 c: 










80 


Leu 


Leu 


T ni% 


Gin 


Val 


Met 


Xaa 


Asn 




o n 










95 




Asn 


Lys 


ji- 

nld 




Pne 


Lys 


His 


Val 


1 fit; 










110 






Gly 


Ser 


Lys 


Asp 


Lys 


Asp 


Thr 


Leu 










125 








Lys 


Tyr 


Asp 


Arg 


Ala 


Val 


Lys 


Glu 








140 








Arg 


He 


Asp 


Ala 


He 


Val 


Pro 


Phe 






155 










160 


Glu 


Thr 


His 


Cys 


Phe 


Cys 


Gly 


He 




170 










175 




Arg 


Ala 


Arg 


Arg 


Asp 


Leu 


Lys 


He 


185 










190 





(2) INFORMATION FOR SEQ ID NO: 122: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: a^ino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1. . .303 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 

Met Ala Phe Gin Val 
1 5 
Gin Ser Ala Leu Thr 
20 

Ser Ser Gly Leu Arg 
35 

Thr Val Ala Asp Ser 
50 

He Ala Asn Thr Asn 
65 

Ala Met Asp Glu Gin 
85 

Thr Gin Ala Ala Gin 
100 

Gin Ser Asp He Val 
115 

Thr Thr Thr Tyr Asn 
130 

Lys Glu Phe Gin Val 
145 

lie Gly Ser Thr Thr 
165 

Gly Ala Leu He Thr 
180 

Val Asp Gly Val Asn 
195 

Ser Ala Gly Thr Gly 
210 

Ser Asn Arg Thr Gly 
225 



Asn Thr 


Asn 


lie 


Asn 


Ala 


Met 


Asn 


Ala 


His 


Val 


Gin Asn 






10 










15 




Ala 


Leu 


Lys 


Thr 


Ser 


Leu 


Glu 


Arg 


Leu 


He Asn 




25 










30 




Lys 


Ala 


Ala 


Asp 


Asp 


Ala 


Ser Gly Met 




40 










45 








Leu Arg 


Ser 


Gin 


Ala 


Ser 


Ser 


Leu 


Gly 


Gin 


Ala 


55 










60 








Asp Gly Met Gly 


He 


lie 


Gin 


Val 


Ala 


Asp 


Lys 


70 








75 








80 


Leu Lys 


He 


Leu 


Asp 
90 


Thr 


Val 


Lys 


Val 


Lys 
95 


Ala 


Asp Gly Gin Thr 


Thr 


Glu 


Ser 


Arg 


Lys 


Ala 


He 






105 








110 






Arg Leu 


lie 


Gin 


Gly 


Leu 


Asp 


Asn 


He 


Gly Asn 




120 










125 








Gly Gin 


Ala 


Leu 


Leu 


Ser 


Gly 


Gin 


Phe 


Thr 


Asn 


135 










140 








Gly Ala 


Tyr 


Ser 


Asn 


Gin 


Ser 


lie 


Lys 


Ala 


Ser 


150 








155 








160 


Ser Asp 


Lys 


He 


Gly 


Gin 


Val 


Arg 


lie 


Ala 


Thr 


Ala Ser 






170 










175 




Gly Asp 


lie 


Ser 


Leu 


Thr 


Phe 


Lys 


Gin 






185 










190 




Asp Val 


Thr 


Leu 


Glu 


Ser 


Val 


Lys 


Val 


Ser 


Ser 




200 










205 






He Gly 


Val 


Leu 


Ala 


Glu 


Val 


He 


Asn 


Lys 


Asn 


215 










220 








Val Lys 


Ala 


Tyr 


Ala 


Ser 


Val 


lie 


Thr 


Thr 


Ser 


230 








235 










240 
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„sp Val Ala Val Gin Ser Gly Ser Leu Ser Asn Leu Thr Leu Asn Gly 



lie His Leu Gly III lie Ala Asp lie Lys Xaa Asn Asp Ser Asp Gly 



265 



Arg Leu Val Tnr Ala He Asn Ala Val Thr Ser Glu Thr Gly Val Xaa 

2 80 



Ala Tyr Thr Asp Gin Lys Gly Arg Leu Asn Leu Arg Ser lie Gly 



290 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .161 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123 

Met Phe Phe Lys Thr Tyr Gin Lys Leu Leu Gly Ala Ser Cys Leu Ala 

leu Tyr Leu Val Gly Cys Gly Asn Gly Gly Gly Gly Glu Ser Pro Val 

Glu Met He Xaa Asn Ser Glu Gly Thr Phe Gin lie Asp Ser Lys Ala 

ASP Ser ne Thr lie Gin Gly val Lys Leu Asn Arg Gly Asn Cys Ala 

Val lln Phe Val Pro Val llr Glu Thr Phe Gin Met Gly Val Leu Ser 

Sn Val Thr Pro He Ser lie Gin Asp Phe Lys Asp Met Ala Ser Thr 

Tyr Lys lie Phe Asp Gin Lys Lys Gly Leu Ala Asn He Ala Asn Lys 

He Ser Gin ITu Glu Gin Lys Gly Val Met Met Lys Pro Xaa Thr Leu 
120 



100 i° 5 

' Met Met 

Ser Gin 

i30 135 "0 



Asn Phe III Glu Ser Leu Lys Gly He Ser Gin Gly Cys Asn lie lie 



Glu III Glu lie Gin Thr Asp Lys Gly Ala Trp Thr Phe Asn Phe Asp 

145 150 
Lys 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION .1 . . .91 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 



Val 


Cys 


Leu 


Gly 


Leu 


Ala 


Asp 


Val 


Met 


Val 


Val 


Leu 


Ser 


Leu 


His 


Leu 


1 








5 










10 










15 




Asn 


Leu 


Asn 


Pro 


Thr 


Asn 


Pro 


Lys 


Trp 


Leu 


Asn 


Arg 


Asp 


Arg 


Leu 


Val 








20 










25 










30 






Phe 


Ser 


Gly 


Gly 


His 


Ala 


Ser 


Ala 


Leu 


Val 


Tyr 


Ser 


Leu 


Leu 


His 


Leu 






35 










40 








45 








Trp 


Gly 


Phe 


Asp 


Leu 


Ser 


Leu 


Asp 


Asp 


Leu 


Lys 


Arg 


Phe 


Arg 


Gin 


Leu 




50 










55 










60 








His 


Ser 


Lys 


Thr 


Pro 


Gly 


His 


Pro 


Glu 


Leu 


His 


His 


Thr 


Glu 


Gly 


He 


65 










70 










75 








80 


Glu 


lie 


Thr 


Thr 


Xaa 


Phe 


Arg 


Ala 


Arg 


Phe 


Cys 











85 90 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1. . .187 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 



Met 


Thr Thr 


Pro 


Met 


He 


He 


He 


Ser 


Leu 


Glu 


Met 


Gly 


Leu 


Ser 


Leu 


1 






5 










10 








15 




Val 


Pro Met 


Arg 
20 


Gin 


Cys 


Leu 


Val 


Cys 
25 


Gin 


Ala 


Leu 


Ala 


Arg 
30 


Ser 


He 


Ser 


Trp Asn 


Gly 


Leu 


Gly 


Gly 


Asn 


Val 


Arg 


Asn 


Thr 


Lys 


Val 


Tyr 


Gly 




35 










40 










45 




Lys 


Phe Ala 


Ala 


Tyr 


His 


His 


Leu 


Gin 


Lys 


Tyr 


Leu 


Leu 


He 


Asp 


Leu 


He 


50 








55 








60 








Ala Arg 


Phe 


Lys 


Thr 


Gin 


Gly Gly 


Tyr 


He 


Phe 


Arg 


Tyr 


Asn 


Thr 


65 








70 










75 






80 


Asp Asp Tyr 


Leu 


Pro 


Leu 


Asn 


Ser 


Thr 


Phe 


Tyr 


Met 


Gly Gly Val 


Thr 


Thr 


Val Arg 




85 










90 








95 




Gly 


Phe 


Arg 


Asn 


Gly 


Ser 


He 


Thr 


Pro 


Lys 


Asp Glu 


Phe 


Gly 




100 










105 








110 






Leu Trp 
115 


Leu 


Gly 


Gly 


Asp 


Gly 
120 


He 


Phe 


Thr 


Xaa 


Ser 
125 


Thr 


Glu 


Leu 


Ser 


Tyr Gly 


Val 


Leu 


Lys 


Ala 


Ala 


Lys 


Met 


Arg 


Leu 


Ala 


Trp 


Phe 


Phe 




130 








135 










140 








Asp 


Phe Gly 


Phe 


Leu 


Thr 


Phe 


Xaa 


Thr 


Pro 


Thr 


Arg 


Gly 


Ser 


Phe 


Phe 


145 








150 










155 






160 


Tyr 


Asn Ala 


Xaa 


Thr 


Thr 


Thr 


Ala 


Asn 


Phe 


Lys 


Asp 


Tyr 


Xaa 


Val 


Val 








165 










170 








175 




Gly 


Xaa Xaa 


Phe 


Glu 


Xaa 


Ala 


Thr 


Trp 


Arg 


Ala 













180 185 
(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii} HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . . 104 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 126 

Thr Lys Leu Glr 
15 

tie He Phe Il< 
20 25 30 

x Lys Phe Val Pro Tyr Glu Ala Glu 
35 4 0 45 



Lys 


Ser 


Leu 


Leu 


Glu 


Val 


5 










10 


Leu 


Met 


His 


He 


Ala 


He 










25 




Lys 


Phe 


Val 


Pro 


Tyr 


Glu 












Ser 


Pro 


Phe 


Phe 


Ser 


Phe 






55 








Gin 


His 


Lys 


Met 


Ser 


Glu 




70 










Asn 


Pro 


Lys 


He 


Val 


Glu 


85 










90 


Ser 


Phe 


Ser 


Gly 







50 55 60 

i Tyr Lys Gin His Lys Met Ser Glu Ser Gin Ser Met Gin Gil 
€5 70 75 80 

, m-* r.in &en A<=n Pro Lvs He Val Glu Asn Lys Xaa Trp His Lys 

95 

Ser Phe Ser Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc__f eature 

(B) LOCATION 1. . .182 





Cxi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 127 










Met 


Gin 


Phe 


Glu 


Glu 


Met 


Lys 


Glu 


Leu 


Ala 


His 


Gin 


He Gly Val 


Phe 


1 








5 








10 










15 




Tyr 


His 


Val 


Gly 


Val 


Asp 


Gly 


He 


Ala 


Leu 


Phe 


Leu 


Leu 


Leu 


Leu 


Asn 






20 










25 










30 




Lys 


Ala 


He 


Val 


Val 


Leu 


Leu 


Ser 


Val 


Val 


Tyr 


Val 


Lys 


Glu 


Arg 


Arg 






35 










40 










45 




Gly 


Val 


Asp 


Phe 


Val 


He 


Cys 


Leu 


Leu 


Leu 


Leu 


Xaa 


Gly 


He 


Leu 


Met 


50 










55 










60 




Glu 


He 


Ser 


Phe 


Ser 


Ser 


Leu 


Asn 


Val 


He 


Phe 


Phe 


Tyr 


Ala 


Phe 


Trp 


65 








70 










75 










80 


Leu 


Leu 


Pro 


Val 


Leu 


Tyr 


Leu 


He 


Gly 


Arg 


Phe 


Gly Arg 


Asn 


Asn 


Lys 








85 








90 








Ala 


95 




He 


Tyr 


Ser 


Gly 


Met 


Lys 


Phe 


Phe 


Leu 


Tyr 


Thr 


Phe 


Leu 


Ser 


Leu 






100 










105 










110 






Cys 


Met 


Leu 


Leu 


Gly 


He 


Leu 


Tyr 


He 


Gly 


Tyr 


Asp 


Tyr 


Ala 


Asn 


Asn 




115 










120 










125 






Phe 


Tyr 


Gly 


Met 


Met 


Ser 


Phe 


Asp 


He 


Leu 


Asp 


Trp 


Tyr 


Gin 


Leu 


Asn 


130 










135 










140 








He 


Ser 


Ser 


Gly 


He 


Lys 


Thr 


Trp 


Leu 


Phe 


Val 


Ala 


Phe 


Leu 


He Gly 


145 






150 










155 






Leu 


Met 


160 


Ala 


Val 


Lys 


He 


Pro 


Leu 


Phe 


Pro 


Phe 


Thr 


His 


Gly 


Cys 


Arg 






165 










170 










175 
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He Leu Thr Pro Pro Leu 
180 



(2) INFORMATION FOR SEQ ID NO: 128: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ i X ) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128 



Val 


Lys 


Lys 


Tyr 


Ala 


Glu 


Asp 


Phe 


He 


Thr 


Lys 


Asp Glu 


Val 


Lys 


Ser 


1 








5 










10 










15 




Leu 


Leu 


Glu 


Arg 
20 


Leu 


Ala 


Lys 


Asp 


Tyr 
25 


Pro 


Thr 


He 


Val 


Glu 
30 


Glu 


Ser 


Lys 


Lys 


He 
35 


Pro 


Thr 


Gly 


Ala 


He 
40 


Arg 


Ser 


Val 


Leu 


Gin 
45 


Ala 


Leu 


Leu 


His 


Glu 
50 


Lys 


He 


Pro 


He 


Lys 
55 


Asp 


Met 


Leu 


Thr 


He 
60 


Leu 


Glu 


Thr 


lie 


Thr 


Asp 


He 


Ala 


Pro 


Leu 


Val 


Gin 


Asn 


Asp 


Val 


Asn 


He 


Leu 


Thr 


Glu 


65 










70 








75 










80 


Gin 


Val 


Arg 


Ala 


Arg 
85 


Leu 


Ser 


Arg 


Val 


He 
90 


Thr 


Asn 


Ala 


Phe 


Lys 
95 


Ser 


Glu 


Asp 


Gly 


Arg 


Leu 


Lys 


Phe 


Leu 


Thr 


Phe 


Ser 


Thr 


Asp 


Xaa 


Glu 


Gin 








100 










105 








110 






Phe 


Xaa 


Ala 
115 


Gin 



























(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

<A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .240 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 

Met Met Lys Asn Lys Arg Ser Gin Asn Ser Pro Tyr Val Thr Pro Asp 

15 10 15 

Asn Pro Tyr Leu Thr Leu Glu Lys Ala Leu Gly Tyr Ser Phe Lys Asp 

20 25 30 

Lys Arg Leu Leu Glu Gin Ala Leu Thr His Lys Ser Cys Lys Leu Ala 

35 40 45 

Leu Asn Asn Glu Arg Leu Glu Phe Leu Gly Asp Ala Val Leu Gly Leu 

50 55 60 

Val lie Gly Glu Leu Leu Tyr His Lys Phe Xaa Xaa Xaa Asp Gly Gly 
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65 70 75 BO 

Lvs Leu Ser Lys Leu Arg Ala Ser lie Val Ser Ala Hxs Gly Phe Thr 

85 90 95 

Lys Leu Ala Lys Ala lie Ala Leu Gin Asp Tyr Leu Arg Val Ser Ser 

100 105 HO 

Ser Glu Glu lie Ser Lys Gly Arg Glu Lys Pro Ser He Leu Ser Ser 

115 120 125 

Ala Phe Glu Ala Leu Met Ala Gly Val Tyr Leu Glu Ala Gly Leu Ala 

130 135 140 

Lvs Val Arg Lys He He Gin Asn Leu Leu Asn Arg Ala Tyr Lys Arg 
lis 150 155 160 

Leu Asp Leu Glu His Leu Phe Met Asp Tyr Lys Thr Ala Leu Gin Glu 

165 170 175 

Leu Thr Gin Xaa Gin Phe Cys Val He Pro Thr Tyr Gin Leu Leu Gin 

180 185 190 

Glu Lys Gly Pro Asp His His Lys Glu Phe Glu Met Ala Leu Tyr lie 
195 200 205 

PJ „ ft C n T trc M<at- Tw>- ft "! a TV,-r /lis Lys f|1\/ T.ye ,?or Ty^ Gin M.? 

210 215 220 

Glu Gin Gin Cys Ala Tyr Gin Ala Leu Gin Asn Leu Arg Lys Pro Asn 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .228 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 



Leu 


Leu 


Val 


Leu 


Leu 


Asn 


Leu 


Lys 


Xaa 


Thr 


Pro 


Asn 


Leu 


Met 


Trp 


Pro 


1 








5 










10 










15 




Leu 


Asp 


lie 


He 


Val 


Val 


Val 


Ala 


Trp 


Val 


Leu 


Trp Gly 


Val 


Asn 


Met 






20 










25 










30 






Phe 


Gly 


Ser 


Met 


Ser 


Val 


Arg 


Arg 


Glu 


Asn 


Thr 


He 


Tyr 


Val 


Ser 


Leu 




35 










40 










45 






Phe 


Trp 


Tyr 


Tyr 


lie 


Ala 


Thr 


Tyr 


Val 


Gly 


lie 


Ala 


Val 


Met 


Tyr 


lie 


50 










55 










60 








Val 


Asn 


Asn 


Leu 


Ser 


He 


Pro 


Thr 


Tyr 


Phe 


Val 


Ala 


Asp 


Met 


Gly 


Ser 


65 










70 










75 








lie 


80 


Trp 


His 


Xaa 


lie 


Ser 


Met 


Tyr 


Ser 


Gly 


Ser 


Asn 


Asp 


Ala 


Leu 


Gin 








85 










90 










95 




Trp 


Trp 


Trp 


Gly 


His 


Asn 


Ala 


Val 


Ala 


Phe 


Val 


Phe 


Thr 


Ser Gly Val 




100 










105 










110 






lie 


Gly 


Thr 


lie 


Tyr 


Tyr 


Phe 


Leu 


Pro 


Lys 


Glu 


Ser 


Gly Gin 


Pro 


lie 




115 








120 










125 






Val 


Phe 


Ser 


Tyr 


Lys 


Leu 


Thr 


Leu 


Phe 


Ser 


Phe 


Trp 


Ser 


Leu 


Met 


Phe 




130 






135 










140 




Val 






Tyr 


lie 


Trp 


Ala 


Gly 


Gly 


His 


His 


Leu 


lie 


Tyr 


Ser 


Thr 


Xaa 


Asp 


145 






150 










155 










160 


Xaa 


Val 


Gin 


Thr 


Leu 


Ser 


Ser 


Xaa 


Phe 


Ser 


Val 


Val 


Leu 


lie 


Leu 


Pro 








165 










170 










175 




Ser 


Xaa 


Gly 


Thr 


Ala 


lie 


Asn 


Met 


Leu 


Leu 


Xaa 


Met 


Arg 


Gly Gin 


Trp 






180 










185 










190 






His 


Gin 


Xaa 


Lys 


Glu 


Ser 


Pro 


Leu 


He 


Lys 


Phe 


Leu 


Val 


Leu 


Ala 


Ser 






195 








200 










205 


Ala 


lie 




Thr 


Phe 


Tyr 


Met 


Leu 


Ser 


Thr 


Leu 


Glu 


Gly 


Ser 


He 


Gin 


Lys 
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210 215 220 

Ser Val Asn Ala 
225 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .162 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 



Met 


Lys 


Ala 


Pro 


Ser 


Gin 


Xaa 


Asp 


Leu 


Lys 


Lys 


He 


Leu Gly 


He 


Glu 


1 








5 










10 










15 




Glu 


Val 


He 


Xaa 


Xaa 


Ser 


Thr 


Ser 


Pro 


Met 


Glu 


Leu 


Arg 


Leu 


Ala 


Asn 








20 










25 








30 






Gin 


Lys 


Leu 


Gly 


Asn 


Arg 


Phe 


He 


Lys 


Thr 


Leu 


Gin 


Ala 


Met 


Asn 


Glu 






35 










40 










45 








Leu 


Asp 


Met 


Gly 


Ala 


Phe 


Phe 


Asn 


Ala 


Tyr 


Ala 


Gin 


Thr 


Thr 


Lys 


Asp 




50 










55 










60 






Pro 


Thr 


His 


Ala 


Thr 


Ser 


Tyr 


Gly 


Val 


Phe 


Ala 


Ala 


Ser 


Leu 


Asn 


Met 


65 










70 










75 










80 


Glu 


Leu 


Lys 


Lys 


Ala 


Leu 


Arg 


His 


Tyr 


Leu 


Tyr 


Ala 


Gin 


Thr 


Ser 


Asn 










85 










90 










95 




Met 


Val 


He 


Asn 


Cys 


Val 


Lys 


Ser 


Val 


Pro 


Leu 


Ser 


Gin 


Asn Asp Gly 








100 










105 










110 






Gin 


Lys 


He 


Leu 


Leu 


Ser 


Leu 


Gin 


Ser 


Pro 


Phe 


Asn 


Gin 


Leu 


He 


Glu 






115 










120 










125 








Lys 


Thr 


Leu 


Glu 


Leu 


Asp 


Glu 


Ser 


His 


Leu 


Cys 


Ala 


Ala 


Ser 


Val 


Gin 




130 










135 










140 










Asn 


Asp 


He 


Lys 


Ala 


Met 


Gin 


His 


Glu 


Ser 


Leu 


Tyr 


Ser 


Arg 


Leu 


Tyr 



145 150 155 160 

Met Ser 



(2) INFORMATION FOR SEQ ID NO: 132: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .59 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

Met Ala Phe He Leu Thr Thr Asn Leu Phe He Lys Ser Phe Thr Asn 
15 10 15 
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Ser lie Arg He Thr Gly Cys He lie Ser Pro Asn Val Phe Phe Ala 

Tyr Glu Phe Cys Ala Leu Gly Phe Arg Lys Gly Gly Leu He Leu Asp 

35 40 
Asn Phe Ser Lys Phe Val Ser His Arg Leu Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . -248 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

Val Arg Phe Phe He Phe Leu He Leu He Cys Pro Leu lie Cys Pro 

i 5 10 x 

Leu Met Ser Ala Asp Ser Ala Leu Pro Ser Val Asn Leu Ser Leu Asn 

Ala Pro Ser Asp Pro Lys Gin Leu Val Thr Thr Leu Asn Val lie Ala 

Leu Leu Thr Leu Leu Val Leu Ma Pro Ser Leu lie Leu Val Met Thr 
50 55 6° 

X 

/ 

Phe Glu Lys Ser Thr Leu Pro Phe Lys Glu Phe Met Leu Lys Asn Thr 



Ser Phe Thr Arg Leu lie Val Val Phe Ser Phe Leu Arg Thr Ala Leu 

Gly Thr Gin Gin Thr llo Pro Thr Gin lie Leu Val Ser Leu Ser Leu 

lie Leu Thr Phe Phe lie Met Glu Pro Ser Leu Lys Lys Ala Tyr Asp 

100 105 
Thr Gly He Lys Pro Tyr Met Asp Lys Lys lie Ser Tyr Thr Glu Ala 
120 " s 



no 135 140 

Arg Glu Lys Asp Leu Ala Leu Phe Phe Arg lie Arg Asn Leu Pro Asn 

Pro Lys Thr Pro Asp Asp Val Ser Leu Ser Val Leu lie Pro Ala Phe 

Met lie Ser Glu Leu Lys Thr Ala Phe Gin He Gly Phe Leu Leu Tyr 

Leu Pro Phe Leu Val lie Asp Met HI He Ser Ser lie Leu Met Ala 

195 200 

Met Gly Met Met Met Leu Pro Pro Val Met lie Ser Leu Pro Phe Lys 

210 215 220 

lie Leu Val Phe lie Leu Val Asp Gly Phe Asn Leu Leu Thr Glu Asn 

225 230 235 *™ 

Leu Val Ala Ser Phe Lys Met Val 
245 

(2) INFORMATION FOR SEQ ID NO:134: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix> FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .166 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134 

Lys lie Ala Asn Leu 
15 

He Tyr He Val Thr 
30 

Phe Glu lie Leu Glu 
45 

Thr Ser Thr Pro Phe 
60 

Leu Gly Lys Leu Phe 
80 

Tyr Asp Ala Leu Val 
95 

Lys Lys Glu Glu Arg 
110 

Gin Glu Ala lie Lys 
125 

Ala Pro Thr Leu Glu 
140 

Asp Ala Lys Xaa Xaa 
160 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc__f eature 

(B) LOCATION 1 . . .127 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 

Met Glu ; 
1 



35 

Asp Phe Asr 
50 

Met Phe Met 
65 



Leu 


Leu 


Val 


Thr 


Phe 


Leu 


Asn 


Gly 


Phe 


Asp 


Pro 


1 








5 








10 




Arg 


Lys 


Ala 


Cys 


Asn 


Val 


Tyr 


Ser 


Val 


Gly 


Val 








20 










25 




Thr 


Asn 


Thr 


Leu 


Asn 


He 


Leu 


Ser 


Cys 


Glu 


Ser 






35 










40 






T.yjs 


Arn 


CHu 


Lp u 


Asp 


Thr 


Ser 


Gly 


Val 


Thr 


Lys 




50~ 










55 










Phe 


Ser 


Arg 


Val 


Glu 


Gly 


lie 


Asp 


Ala 


Gly 


Thr 


65 










70 








75 


Ser 


Gly 


Ser 


Gin 


Ser 


Lys 


Asn 


Tyr 


Phe 


Ala 


Tyr 










85 










90 


Lys 


Lys 


Glu 


Lys 


Arg 


Lys 


Glu 


Val 


Arg 


lie 


Glu 


He 






100 










105 






Asp 


Ala 


Arg 


Glu 


Asn 


Lys 


Arg 


Glu 


lie 


Lys 






115 










120 






Glu 


Pro 


Lys 


Lys 


Ala 


Asn 


Gin 


Gly 


Thr 


Glu 


Asn 




130 










135 








Glu 


Lys 


Xaa 


Tyr 


Gin 


Lys 


Ala 


Glu 


Arg 


Lys 


Phe 


145 










150 






155 


Arg 


Arg 


Ser 


Phe 


Lys 


Xaa 




















165 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO:135: 







Ser 


Thr 


Leu 


Tyr 


He 


Val 


He 


Ala 


Gly 


Leu 


Trp 


Leu 


Ala 


Gly 


5 










10 








15 




He 


Phe 


Leu 


Lys 


Lys 


Leu 


Asp 


Met 


Pro 


Val 


He 


lie 


20 










25 










30 






Cys 


Thr 


Gly 


Thr 


Val 


Leu 


Ala 


Ala 


Phe 


Phe 


Lys 


He 


Asn 










40 










45 






Leu 


Leu 


Ser 


Asp 


He 


Gly 


Glu 


Phe 


Gly 


He 


Val 


Phe 


Leu 








55 










60 








He 


Gly 


He 


Glu 


Phe 


Asn 


Phe 


Asp 


Lys 


Leu 


Lys 


Ser 


He 






70 










75 








80 


Val 


Leu 


Val 


Phe 


Gly 


Leu 


Leu 


Gin 


Val 


Val 


Leu 


Cys 


Ala 


Phe 


85 










90 










95 




Leu 


Leu 


Gly 


Tyr 


Phe 


Val 


Leu 


Gly 


Leu 


Ser 


Pro 


lie 


100 










105 










110 




Val 


Leu 


Gly 


Met 


Gly 


Leu 


Ser 


Leu 


Ser 


Ser 


Thr 


Ala 





SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



143 

115 120 125 

(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

'A* NAME /KEY : ir.isc feature 
(BJ LOCATION 1. . .16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 

Leu Leu Leu Met Leu Asn Lys Pro Lys Pro Leu Phe Leu Xaa Leu Gly 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc__feature 

(B) LOCATION 1. . .350 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 



Met 


Ala 


Leu 


Arg 


Val 


Leu 


Leu 


Phe 


Phe 


Cys 


Phe 


Leu 


Phe 


Leu 


Gin 


Ala 


1 






5 










10 










15 




Glu 


Asp 


Lys 


Ser 


Gin 


Glu 


Leu 


Ser 


Ser 


He 


Gin 


Lys 


Gin 


Met 


Ala 


Leu 




20 










25 










30 






Val 


Asp 


Lys 


Lys 


Leu 


Ala 


Lys 


Asp 


Asp 


Asn 


Val 


Trp 


Leu 


Lys 


Lys 


Phe 




35 








40 










45 








Glu 


Asn 


Tyr 


Lys 


He 


Tyr 


Asn 


Gin 


He 


Tyr 


Thr 


Glu 


Lys 


Glu 


Ser 


Val 




50 






55 










60 










Arg 


Gin 


Glu 


Leu 


Arg 


Arg 


Leu 


Lys 


Asn 


Lys 


Lys 


Ser 


Lys 


Asp 


Leu 


Leu 


65 










70 










75 










80 


Lys 


lie 


Ser 


Thr 


Leu 


Glu 


His 


Thr 


Leu 


Lys 


Ala 


Leu 


Glu 


Ser 


Gin 


Gin 








85 










90 










95 




Lys 


Met 


Phe 


Glu 


Ser 


Tyr 


Gly 


Val 


Asn 


Pro 


Phe 


Lys 


Asp 


Leu 


He 


Glu 






100 










105 










110 






Arg 


Pro 


Asn 


He 


Pro 


Asn 


He 


Pro 


Asn 


He 


Ala 


Asn 


Pro 


He 


Ala 


He 




115 










120 










125 


His 






lie 


Asp 


Gly 


He 


Ser 


Phe 


He 


Lys 


Ser 


Met 


Arg 


Leu 


Lys 


Glu 


Asn 




130 








135 










140 










Leu 


Lys 


Asn 


Asn 


Gin 


Thr 


Ser 


Leu 


Gly Glu 


Val 


Leu 


Lys 


Leu 


Leu 


Asp 


145 








150 










155 










160 


Gin 


Lys 


His 


Gin 


Leu 


Leu 


Asn 


Gin 


Trp 


His 


Ala 


Leu 


Asp 


Lys 


Ser 


Ala 








165 










170 










175 




Lys 


Leu 


Ser 


Asp 


Glu 


He 


Tyr 


Gin 


Thr 


Gin 


Ala 


Lys 


Arg 


Leu 


Glu 


Leu 






180 










185 










190 
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Gin Oly Ala Gin Asn Ile Leu , Thr Thr Qly n- phe ^ ^ 

Asp Ser Asp Glu Ala Ile ^ ne val Lys ^ J05 ^ ^ ^ 

Leu Phe Lys Leu Val Tyr Val Phe Leu Ala Ala LeS Leu Ser Val Val 
Phe Ala Trp „. ffi Lys Ile n . Ser £S ^ ^ ^ ^ ^ 

01- Ar S val Tyr Thr Val Asn Lys ^ Asn ^ ^ 255 ^ 



Val lie xaa Xaa lie Xaa Leu Phe Ser Tyr Leu Glu Asn ill Thr Tyr 
Leu Val Thr Val Leu Gly Phe Ala Ser Ala Gly Leu III U . X aa Met 



Lys A SP Leu Phe Met — 01y ^ ^ 300 ne ^ ^ 



Oly Ser Val „ is Val Asp Arg Val £ ^ Lys GJy ^ 320 



333 



He Phe lie Gly Asp Val Leu Asp Thr Ser Asn Val Val His 

345 350 
(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(BJ LOCATION 1 ... 99 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8 
Met Lys Glu Gin Glu Trp Asp Leu Ser Ala Leu Phe Glu Asn Lys Glu 
Ser Ala Glu Glu Phe Leu Lys Thr Leu oL Thr Glu Val Gin otu Phe 
Glu Asn Ala Tyr Gin Asn Asn Leu Lys Asn Leu Asp Ala Al'a Lys Phe 
Ala Asn Thr Leu Lys His Tyr Glu Asn Leu Ser Glu tys lie Ser Arg 
Ala Met Ala Tyr Ala Asn Tyr Phe Leu Pro Arg ?£r Leu Lys Lys Arg 



Ser Phe lie Arg Asn Ala Asn Gly Leu Cys Lys Tyr Pro Thr Thr Iro 
Phe lie Leu 90 95 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 78 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .78 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 



Leu 


Arg 


Val 


Gly 


Met 


Tyr 


Glu 


Val 


Cys 


Asn 


His 


Lys 


Asp Gly Thr 


Ala 


1 






5 










10 








15 




Tyr 


Kis 


Ser 


Thr 


Arg 


Gly 


Ser 


Lys 


Val 


Thr 


Leu 


Ala 


Cys 


Lys Thr 


Gly 






20 










25 










30 




Thr 


Ala 


Gin 


Val 


Val 


Glu 


He 


Ala 


Gin 


Asn 


He 


Val 


Asn 


Arg Met 


Lys 




35 










40 










45 


He Thr 




Glu 


Lys 


Asp 


Met 


Glu 


Tyr 


Phe 


His 


Xaa 


Ser 


His 


Xaa 


Trp 


Xaa 




50 










55 










60 








Tyr 


Leu 


Xaa 


Pro 


Met 


Lys 


Asn 


Pro 


Asn 


Thr 


Leu 


Ser 


Leu 


Phe 




65 










70 










75 










(2) 


INFORMATION 


FOR 


SEQ 


ID 1 


WO:140: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .52 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 

Leu Gly Leu Val Xaa Gly He Ser Leu Leu His Leu Ser Leu Glu Gin 

1 5 10 15 

Lys lie Ser Val Phe Leu Gly Xaa Asn Leu Met Leu Tyr Pro Val Xaa 

20 25 30 

Glu Val Leu Phe Ser lie Leu Arg Arg Lys lie Lys Arg Gin Lys Ala 

35 40 45 

Thr His Ala Gly 
50 

{2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 

Leu Ala Gin Pro Val Gin Val Arg Thr Val Phe Met Ser Met Thr Leu 
5 10 15 
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Asn 


Ala 


Met 


Gly 


Gin 


?he 


Ala 


Tyr Asn 


Phe 


Pro 


Ala 


Asn 


He 


Ser 


Lys 


Asp 






20 








25 










30 




Lys 


Gin 


Lys 


Leu 


Thr 


Met 


Val Tyr 


Met 


Asp 


Lys 


Asp 


Tyr 


Gly Ala 






35 










40 








45 






Tyr 


Gly Asn 


He 


Val 


Ala 


Met 


Gly Gly Glu Tyr Val 


Lys 


He 


Glu 


Leu 














55 








60 








Gly 


Thr 


Asp 


Thr 


Gly 


Leu 


Asn 


Pro Phe 


Ala 


Trp 


Ala 


Ala 


Cys 


Val 


Gin 


65 










70 








75 








80 


Lys 


Thr 


Asn 


Ala 


Thr 


Met 


Glu 


Gin Lys 


Gin 


Thr 


Ala 


He 


Ser 


Val 


Val 




Glu 






85 








90 










95 




Lys 


Leu 


Val 


Lys 


Asn 


Leu 


Ala Thr 


Lys 


Ser 


Asp 


Glu 


Lys 


Asp Glu 








100 








105 










110 






Asn 


Gly Asn 


Ser 


He 


Ser 


Phe 


Ser Leu 


Ala 


Asp 


Ser 


Asn 


Thr 


Leu 


Ala 






115 










120 






125 






Ala 


Ala 


Val 


Thr 


Asn 


Leu 


lie 


Thr Gly Asp Met Asn 


Leu Asp Tyr 


Pro 




130 










135 








140 










lie 


Thr 


Gin 


Leu 


He 


Asn 


Ala 


Phe Gly Lys Asp His 


Asn Asp 


Pro 


Asn 


145 










150 








155 










i ah 


Gly 


Leu 


Val 


Ala 


Arg 


Leu 


Ala 


Pro Phe 


Cys 


Lys 


Ser 


Thr 


Asn Gly Glu 


Phe 


Gin 






165 








170 










175 




Trp 


Leu 


Phe 


Asp 


Asn 


Lys Ala 


Thr 


Asp 


Arg 


Leu 


Asp 


Phe 


Ser 




Thr 




180 








185 








190 






Lys 


He 


lie 


Gly Val 


Asp 


Gly Ser 


Ser 


Phe 


Leu 


Asp 


Asn 


Asn 


Asp 


Val 




195 










200 








205 






Ser 


Pro 


Phe 


He 


Cys 


Phe 


Tyr Leu 


Phe 


Ala 


Arg 


He 


Gin 


Glu 


Ala 




210 










215 








220 








Met 


Asp Gly Arg Arg 


Phe 


Val 


Leu Asp 


lie 


Asp 


Glu 


Ala 


Trp 


Lys 


Tyr 


225 










230 








235 






240 


Leu 


Gly Asp 


Pro 


Lys 


Val 


Ala 


Tyr Phe 


Val 


Arg 


Asp 


Met 


Leu 


Lys 


Thr 


Ala 








245 








250 










255 




Arg 


Lys 


Arg 
260 


Asn 


Ala 


lie 


Val Arg 
265 


Leu 


Ala 


Thr 


Gin 


Ser 
270 


He 


Thr 


Asp 


Leu 


Leu 


Ala 


Cys 


Pro 


He 


Ala Asp 


Thr 


He 


Arg 


Glu 


Gin 


Cys 


Pro 


Thr 




275 










280 








285 






Lys 


He 


Phe 


Leu 


Arg 


Asn 


Asp Gly Gly Asn 


Leu 


Ser 


Asp 


Tyr 


Gin 




290 










295 








300 






Arg 


Leu 


Ala 


Asn 


Val 


Thr 


Glu 


Lys Glu 


Phe 


Glu 


He 


He 


Thr 


Lys 


Gly 


305 










310 








315 








320 


Leu 


Asp 


Arg 


Lys 


He 


Leu 


Tyr 


Lys Gin 


Asp 


Gly 


Ser 


Pro 


Ser 


Val 


He 


Ala 








325 








330 










335 




Ser 


Phe 


Asn 
340 


Leu 


Arg 


Gly 


He Pro 
345 


Lys 


Glu 


Tyr 


Leu 


Lys 
350 


He 


Leu 


Ser 


Thr 


Asp 


Thr 


Val 


Phe 


Val 


Lys Glu 


He 


Asp 


Lys 


He 


He 


Gin 


Asn 


His 




355 










360 






365 








Ser 


He 


He 


Asp 


Lys 


Tyr 


Gin Pro 

















370 375 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .154 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

Met Lys Leu Val Ser Leu He Val Ala Leu Val Phe Cys Cys Phe Leu 
15 10 15 
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Gly 


Ala 


Val 


Glu 


Leu 


Pro 


Gly 


Val 


Tyr 


Gin 


Thr 






20 










25 




His 


Met 


Lys 


Ser 


Ser 


Phe 


Val 


Glu 


Phe 


Phe 


Glu 




3 5 










40 








Ala 


Tyr 


Gly 


He 


Ser 


Asp 


Val 


Xaa 


Xaa 


Ser 


Lys 




50 








55 










Leu 


Asn 


Pro 


Asn 


Pro 


Lys 


Leu 


Arg 


Asn 


Arg 


Ser 


65 










70 










75 


Phe 


Leu 


Ser 


Asp 


Leu 


I le 


Lys 


val 


uiy 


blU 












85 










90 




Lys 


Ala 


Xaa 


Asn 


Phe 


Xaa 


Asp 


Gly 


Lys 


Thr 


Xaa 






100 










105 






Gin 


Xaa 


Ser 


Asn 


Gly 


Asp 


Leu 


Xaa 


Phe 


Thr 


Ser 






115 










120 








Gly Tyr 


Val 


Gly 


Lys 


Thr 


Phe 


Thr 


Trp 


Lys 


Arg 




130 










135 










He 


Lys 


Asn 


Leu 


Lys 


Leu 


Lys 


Arg 


Phe 


Asn 




145 










150 












(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO:143: 







147 

ir Gin Glu Phe Leu Tyr 
30 

Asn Gly Lys Phe Tyi 
45 

Ala Lys Lys Asp Lys 
60 

Asp Lys Gly Val Val 
80 

Ser Tyr Lys Gly Gl} 
95 

His Val Arg Val Th3 
110 

Ser Tyr Xaa Lys Tri 
125 

Leu Ser Asp Glu Gli 
140 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

Leu Glu Thr Leu Phe Leu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 . . .114 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 

Met Xaa Thr His Asp Arg Arg Lys Leu Arg He Xaa Leu Thr Gin Thr 

15 10 15 

Thr Thr Leu Val Ala Thr He Gly Ser Asn Ala Pro Tyr He Gly Leu 

20 25 30 

Leu Gly Thr Val Met Gly He Met Leu Thr Phe Met Asp Leu Gly Ser 

35 40 45 

Ala Ser Gly He Asp Thr Lys Ala He Met Thr Asn Leu Ala Leu Ala 
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Leu lys Ala Thr Gly ^ » y Leu ^ 60 e ne 



He Tyr Asn Leu Leu Val Arg Lys ^ ^ ^ ^ ^ ^ ^ 

Ue Phe His His Pro v.l A sp Thr G J B Ser Kis cl „ v-1 £ ^ 

Lys Ala 10b 110 

(2) INFORMATION FOR SEQ ID NO: 145; 

<i) SEQUENCE CHARACTERISTICS • 

{ A) LENGTH: 67 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
Ux) FEATURE: 

(A) NAME /KEY : misc_f eature 

<B) LOCATION 1. . .67 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14 5 
Met Gin Asp Leu Asp Asn Asn Met Ser Leu Asp Thr Ala His Asn Thr 
L eu Ser Ser Asn Gly Lys Asn lie Thr iL Ala Gly Val Val Ala 
Leu Gin Lys lie Gly Val Ser Ala Lys Gly Met Val Ser £. Leu Gln 
Ala Leu Lys Lys Ser Gly Ala lie Ser Ala Lys Trp ll g Tyr Tyr Asp 

Lys Gin Gin 60 
65 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 88 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
(ix) FEATURE: 

(A) NAME /KEY: misc feature 

<B> LOCATION 1. . .88 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:146 
Leu His Pro Leu Ala Asp Val Phe Val ^ Asn Asp Lys Arg ^ ^ 
Leu Ala Met Val Xaa Met Leu U e Xaa JJ, Leu Ala Asn U e £. Phe 
Asn Tyr Leu Phe He Phe Xaa Leu Glu Val Gly Val ai„ lly Xaa Ala 
He Val Thr Val lie Gly His Ala He Gly Gly Leu Va\ Leu Met Gin 
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His Phe Trp Arg Lys Lys Gly Glu Leu Tyr Phe lie Lys Leu lie Phe 
65 70 75 80 

Phe lie Phe Ser His Phe Phe Ser 
85 

(2) INFORMATION FOR SEQ ID NO: 147: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) CUCANISK: Helicobacter pylori 

(ix> FEATURE: 

(A) NAME /KEY: misc_£eature 

(B) LOCATION 1 . . .276 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 



Met 


Leu 


Arg 


Lys 


Asn 


He 


Leu 


Ala 


Tyr 


Tyr 


Gly Ala 


Asn 


Phe 


Leu 


Leu 


1 






5 










10 










15 




lie 


He 


Ala 


Gin 


Ser 


Leu 


Pro 


His 


Ala 


He 


Leu 


Thr 


Pro 


Leu 


Leu 


Leu 








20 










25 










30 






Ser 


Lys 


Gly 


Leu 


Ser 


Leu 


Ser 


Glu 


He 


Leu 


Leu 


Val 


Gin 


Thr 


Phe 


Phe 




35 










40 










45 








Ser 


Phe 


Cys 


Val 


Leu 


Val 


Ala 


Glu 


Tyr 


Pro 


Ser 


Gly Val 


Leu 


Ala 


Asp 




50 








55 










60 










Leu 


Met 


Ser 


Arg 


Lys 


Asn 


Leu 


Phe 


Leu 


Val 


Ser 


Asn 


Val 


Phe 


Leu 


lie 


65 






70 










75 










80 


Ala 


Ser 


Phe 


Ser 


Leu 


Val 


Leu 


Phe 


Phe 


Asp 


Ser 


Phe 


He 


Leu 


Met. 


Leu 










85 










90 










95 




Leu 


Ala 


Trp 


Gly 


Leu 


Tyr 


Gly 


Leu 


Tyr 


Ser 


Ala 


Cys 


Ser 


Ser 


Gly 


Thr 






100 










105 










110 






He 


Glu 


Ala 


Ser 


Leu 


He 


Thr 


Asp 


He 


Lys 


Glu 


Asn 


Lys 


Lys 


Asp 


Leu 






115 










120 










125 








Ser 


Lys 


Phe 


Leu 


Ala 


Lys 


Asn 


Asn 


Gin 


He 


Thr 


Tyr 


Leu Gly Met 


He 




130 








135 










140 










He 


Gly 


Ser 


Ser 


Leu 


Gly 


Ser 


Phe 


Leu 


Tyr 


Leu 


Lys 


Val 


His 


Ala 


Met 


145 








150 










155 










160 


Leu 


Tyr 


Val 


Val 


Gly 


He 


Phe 


Leu 


He 


Met 


Leu 


Cys 


Ala 


Leu 


Thr 


He 








165 










170 










175 




He 


He 


Tyr 


Phe 


Lys 


Glu 


Lys 


Glu 


Gly 


Asp 


Phe 


Lys 


Ser 


Gin 


Lys 


Asn 






180 








185 










190 






Leu 


Lys 


Leu 


Leu 


Lys 


Glu 


Gin 


Val 


Lys 


Gly 


Ser 


Leu 


Lys 


Glu 


Leu 


Lys 






195 








200 










205 








Asp 


Asn 


Pro 


Lys 


Leu 


Lys 


.He 


Leu 


Leu 


Val 


Gly 


His 


Leu 


He 


Thr 


Pro 


210 








215 










220 










Val 


Phe 


Phe 


Met 


Ser 


His 


Phe 


Gin 


Met 


Trp 


Gin 


Ala 


Tyr 


Phe 


Leu 


Lys 


225 










230 










235 










240 


Gin 


Gly 


Val 


Lys 


Glu 


Gin 


Tyr 


Leu 


Phe 


Val 


Phe 


Tyr 


lie 


Ala 


Phe 


Gin 






245 










250 










255 




Val 


lie 


Ser 


He 


Pro 


His 


Ser 


Phe 


Phe 


Lys 


Ser 


Gin 


Lys 


Leu 


Xaa 


Ala 








260 










265 










270 






Lys 


Lys 


Ser 


Pro 






























275 




























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:148: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1. . .93 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8 

Leu Tyr Pro Pro Gly Ser Val Val Lys Met Gly Val Gly Leu Ser Phe 

5 10 15 

Leu Glu Asn Leu His He Thr Glu Asn Thr Thr He Pro Thr Pro Pro 

20 25 30 

Pne lie Giu Val Giy Lys Arg Lys Phe Arg Asp Trp Lys Lys Thr Gly 

35 40 45 

His Gly Asn Ser Asn Leu Tyr Lys Ala He Arg Glu Ser Val Asp Val 

50 55 go 

Tyr Phe Tyr Lys Phe Gly Leu Glu lie Ser lie Glu Xaa Leu Ser Lys 

70 75 8 1 

Xaa Phe Lys Xaa Ser Gly Leu Trp Gly Lys Asn Gly Arg 
85 go 

(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc__f eature 

(B) LOCATION 1 . . .60 

(xi) SEQUENCE DESCRIPTION .* SEQ ID NO: 149 

Met Ala His His Xaa Glu Gin His Gly Gly His His His His His His 

■ 5 10 15 

His Thr His His His His Tyr His Gly Gly Glu His His His His His 

20 25 30 

His Ser Ser His His Glu Glu Gly Cys Cys Ser Thr Ser Asp Ser His 

35 40 45 

His Gin Glu Glu Gly Cys Cys His Gly Xaa His Glu 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .297 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 



Leu Val Lys He Arg Leu Phe Asp Phe Thr He Arg Leu Phe Lys Pro 

15 10 15 

Glu Phe His He Phe Asp Phe Leu Lys Gly He Arg Val Leu Met He 

20 25 30 

Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr He Trp lie 

35 40 45 

Ser Thr He Ala Phe He Ala Ala Gly Met He Gly Trp Gly Gin Tyr 

50 55 60 

Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin lie Lys 
65 70 75 80 

lie Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys Asp Ala 

n r r.n r\r 

u ^ J v/ J -J 

Tyr Ala Glu Ser lie Pro Asp Phe Lys Glu Leu Thr Glu Asp Gin lie 

100 105 110 

Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu lie Asn Gin 

115 120 125 

Ala Leu Leu Arg Asn Phe Ala Leu Asp Leu Gly Leu Gly Ala Thr Lys 

130 135 140 

Gin Glu Val Ala Lys Glu He Arg Lys Thr Asn Val Phe Gin Lys Asp 
145 150 155 160 

Gly Val Phe Asp Glu Glu Leu Tyr Lys Asn lie Leu Lys Gin Ser His 

165 170 175 

Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu lie Leu 

180 185 190 

Gin Lys He Ser Ala Leu Phe Pro Lys Thr Thr Thr Pro Leu Glu Gin 

195 200 205 

Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp He Leu 

210 215 220 

lie Leu Asn Pro Asn Asp Val Lys lie Ser Leu Asn Glu Glu Glu Met 
225 230 235 240 

Lys Lys Tyr Tyr Glu Asn His Arg Lys Asp Phe Lys Lys Pro Thr Ser 

245 250 255 

Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys Thr Asp 

260 265 270 

Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser Tyr Leu 

275 280 285 

Asp Xaa Xaa Gly Glu lie Thr Gly Phe 
290 295 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

Met Val Lys His Tyr Leu Phe Met Ala Val Ser Gin Val Phe Phe Ser 
15 10 15 

Phe Phe Leu Val Leu Phe Phe He Ser Ser lie Val Leu Leu lie Ser 
20 25 30 
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He 


Ala 


Ser 


Val 


Thr 


Leu 


Val 


He Lys Val 


Ser 


Phe 


Leu 






35 










40 






45 


Gin 


Leu 


Phe 


Leu 


Tyr 


Ser 


Leu 


Pro Gly Thr 


He 


Phe 


Phe 




50 










55 




60 




He 


Thr 


Phe 


Phe 


Ala 


Ala 


Xaa 


Arg Leu Gly 


Xaa 


Ser 


Arg 


65 










70 




75 




Asp 


His 


Glu 


Leu 


Leu 


Val 


Phe 


Phe Leu Xaa 
















85 






90 








(2) 


INFORMATION 


FOR 


SEQ 


ID NO;152: 









80 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix J FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 



Met 


Ser 


Lys 


Arg 


Ala 


He 


Arg 


Phe 


Pro 


Asn 


Lys 


Leu 


Phe 


Ser 


Tyr 


Pro 


1 








5 










10 








15 




Lys 


Pro 


Lys 


He 


Lys 


Ala 


Thr 


Asn 


Thr 


Ser 


His 


Thr 


Val 


Leu 


Phe 


Ala 








20 










25 










30 






Tyr 


Pro 


Leu 


Lys 


Pro 


His 


Glu 


Met 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Thr 


Ser 






35 










40 










45 








Leu 


Leu 


Ala 


Pro 


He 


Phe 


Asn 


Ala 


He 


His 


Ser 


Thr 


Asn 


Ala 


Leu 


Asn 


Ala 


50 










55 










60 










He 


Lys 


Pro 


Asp 


Gly 


Thr 


Gly 


Ser 


Lys 


He 


Asn 


Pro 


lie 


He 


Met 


65 










70 








75 










80 


Pro 


Met 


Lys 


He 


Gin 


Lys 























85 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .141 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

Val Tyr Ser Arg Phe Phe Ala Asn Gin His Glu Phe Asp Phe Glu Ala 

1 5 10 15 

Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys Ser Thr 

20 25 30 

Leu Leu Gin Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr Ser Ala 

35 40 45 

Thr Ala Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg Asn Ala 
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153 

50 55 60 



Leu 


Val 


Leu 


Lys 


Pro 


Ser 


Val 


Gly 


Val 


Ser 


Tyr 


Asn 


His 


Leu 


Gly 


Ser 


65 










70 










75 










80 


Thr 


Asn 


Phe 


Lys 


Ser 


Asn 


Ser 


Gin 


Ser 


Gin 


Val 


Ala 


Leu 


Lys 


Asn 


Gly 








85 










90 










95 




Ala 


Ser 


Ser 


Gin 


His 


Leu 


Phe 


Asn 


Ala 


Asn 


Ala 


Thr 


Trp 


Lys 


Arg 


Val 








10O 










105 










110 






He 


He 


Met 


Gly 


Thr 


Leu 


His 


Thr 


Phe 


He 


Cys 


Met 


Trp 


Giu 


Phe 


Tyr 






115 








120 










125 








Lys 


Ser 


Ser 


Leu 


Thr 


Leu 


Asp 


Arg 


Met 


Met 


Trp 


Arg 


Leu 








130 










135 










140 











(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

/ rv» rn/-> p/-\r r\r*\r . t * — — *■ 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .185 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 



Met 


Xaa Glu 


Asn 


Gly 


Arg 


Gly 


Val 


Pro 


Lys 


Asp 


Tyr 


Lys 


Lys 


Ala 


Val 


1 






5 










10 










15 




Glu 


Tyr Phe 


Gin 


Lys 


Ala 


Val 


Asp 


Asn 


Asp 


He 


Pro 


Arg Gly 


Tyr 


Asn 






20 








25 










30 






Asn 


Leu Gly 


Val 


Met 


Tyr 


Lys 


Glu 


Gly 


Lys 


Gly 


Val 


Pro 


Lys 


Asp 


Glu 




35 










40 










45 








Lys 


Lys Ala 


Val 


Glu 


Tyr 


Phe 


Arg 


lie 


Ala 


Thr 


Glu 


Lys 


Gly Tyr 


Thr 




50 








55 










60 










Asn 


Ala Tyr 


He 


Asn 


Leu 


Gly 


He 


Met 


Tyr 


Met 


Glu 


Gly Arg 


Gly 


Val 


65 






70 








75 










80 


Pro 


Ser Asn 


Tyr 


Ala 


Lys 


Ala 


Thr 


Giu 


Cys 


Phe 


Arg 


Lys 


Ala 


Met 


His 








85 








90 










95 




Lys 


Gly Asn 


Val 


Xaa 


Ala 


Tyr 


He 


Leu 


Leu 


Gly 


Asp 


He 


Tyr 


Tyr 


Ser 






100 










105 










110 






Gly 


Met He 


Asn 


Trp 


Val 


Leu 


Ser 


Arg 


Thr 


Lys 


He 


Arg 


Leu 


Val 


His 


115 










120 










125 








Tyr 


Lys Met 


Ala 


Ala 


Asp 


Val 


Ser 


Ser 


Ser 


Arg 


Ala 


Tyr 


Xaa 


Gly 


Leu 




130 








135 










140 










Ser 


Glu Ser 


Tyr 


Xaa 


Tyr 


Gly 


Leu 


Gly 


Val 


Glu 


Lys 


Xaa 


Xaa 


Lys 


Lys 


145 






150 










155 










160 


Ala 


Glu Glu 


Tyr 


Met 


Gin 


Lys 


Ala 


Cys 


Asp 


Phe 


Asp 


He 


Asp 


Lys 


Asn 








165 










170 










175 




Cys 


Lys Lys 


Lys 


Asn 


Thr 


Ser 


Ser 


Arg 




















180 










185 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 155: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY ; misc_feature 

(B) LOCATION 1. . .139 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 



Leu 


Leu 


Asn 


Met 


Trp 


Asp 


Glu 


Ala 


Lys 


Lys Glu Gly 


lie 


Asn 


He 


Asn 


1 


Glu 






5 










10 










15 




Thr 


Lys 


Leu 


Ser 


Gin 


Glu 


Leu 


Gly 


Va] 


Val 


Cys 


Val 


Pro 


Thr 


Ser 








20 










25 










30 




Ala Arg 


Xaa 


Lys 


Glu 


Asp 


Arg 


Leu 


Asn 


Thr 


Glu 


Leu 


Leu 


Leu 


Asp 


Glu 


He 




35 










40 










45 




Val 
50 


Arg 


Leu 


Tyr 


Ser 


Gin 
55 


Asn 


Thr 


Thr 


Asn 


Asn 
60 


Glu 


Asn 


He 


Lys 


Val 


Pro 


Ser 


Gin 


Ser 


Phe 


Lys 


Glu 


Ser 


Leu 


Lys 


Tyr 


Ser 


Gin 


Ser 


Ala 


6 5 










70 










75 








20 


Gin 


Arg 


He 


Ala 


Lys 


Ser 


Val 


He 


Ser 


Glu 


Asn 


Lys 


Gin 


Asn 


Ala 


Ser 


Phe 


Glu 


His 




85 










90 








95 


Thr 


Tyr 


Lys 


He 


Asp 


Lys 


He 


Phe 


Asn 


Ala 


Pro 


Ala 


Leu 








100 










105 










110 




Trp 


Asp 


Phe 


His 


Phe 


Phe 


Xaa 


Val 


Tyr 


Val 


Tyr 


His 


Leu 


Phe 


Phe 


Glu 




Phe 


115 










120 








125 




Leu 


Asn 


Arg 


Arg 


Gly 


Ser 


Ala 


Lys 


Ser 


Pro 












130 










135 



















(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 

Met Gin C 
1 



35 

He Thr He 
50 

Leu Val lie 
65 



115 

Phe Leu Trp 

130 
Gly Ser Val 
145 



Ala 


Leu 


Leu 


Arg 


Phe 


Gin Glu 


Gly 


Phe 


Lys 


Glu 


Trp Gly 




5 








10 










15 




Leu 


Phe 


Leu 


Tyr 


Ser 


Leu Gly 


Gly 


Gly Tyr Val Gly 


He 


20 


He 








25 








30 






Ser 


Leu 


Ser 


Ala 


Thr Thr 


His 


Ala 


Leu Asp 


He 


Lys 










40 








45 






Leu 


Val 


Ala 


Phe 


Leu 


Gly Asn 


Leu 


He 


Gly 


Ser Gly Ala 


Phe 






55 








60 








Ala 


Arg 


Tyr 


Gin 


Lys Arg 


Glu 


Phe 


Leu 


Lys 


Tyr 


Phe 






70 








75 






80 


Arg 


Arg 


Lys 


Leu 


Ala 


Leu Ala 


Ser 


Leu 


Trp 


Val 


Lys 


Arg 




85 








90 










95 


Leu 


Met 


He 


Phe 


Val 


Asn Lys 


Tyr 


Leu 


Tyr 


Gly 


He 


Lys 


100 










105 








110 




Pro 


Leu 


Ala 


He 


Gly 


Phe Ser 


Lys 


Tyr 


Pro 


Leu 


Lys 


Lys 










120 








125 




Leu 


Asn 


Val 


Phe 
135 


Ser 


Ser Phe 


Leu 


Trp 
140 


Ala 


Leu 


He 


Val 


Ser 


Phe 


Gin 


Ala 


Ser 


Asp Trp 


Val 


Lys 


Thr 


Leu 


Tyr 


Glu 






150 








155 








160 


His 


Tyr 


Thr 


Ser 


Phe 


Phe Val 


He 


Ser 


Phe 


Val 


Leu 


He 




165 








170 










175 
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155 



Ala Leu Leu lie Trp Phe Leu Leu Lys Arg Tyr Ser Arg Lys Met Gly 
180 185 190 

Phe 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

< i i i ) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(AS ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .129 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 157 



Met 


Arg 


Lys 


Gly 


Arg 


Val 


Met 


Leu 


Cys 


Val 


Phe 


Asp 


He 


Glu 


Thr 


He 


1 








5 










10 










15 




Pro 


Asn 


lie 


Ser 


Leu 


Cys 


Lys 


Glu 


His 


Phe 


Gin 


Leu 


Lys 


Glu 


Asp 


Asp 








20 










25 










30 




Ala 


Leu 


Lys 
35 


He 


Cys 


Glu 


Trp 


Ser 
40 


Phe 


Glu 


Lys 


Gin 


Lys 
45 


Glu 


Lys 


Ser 


Gly 


Ser 


Glu 


Phe 


Leu 


Pro 


Leu 


Tyr 


Leu 


His 


Glu 


He 


He 


Ser 


He 


Ala 




50 










55 








60 










Ala 


Val 


He 


Gly 


Asp 


Asp 


Tyr 


Gly 


Gin 


Phe 


He 


Lys 


Val 


Gly 


Asn 


Phe 


65 










70 










75 










80 


Gly Gin 


Lys 


His 


Glu 


Asn 


Lys 


Glu 


Asp 


Phe 


Ala 


Ser 


Glu 


Lys 


Glu 


Leu 










85 










90 










95 




Leu 


Glu 


Asp 


Phe 
100 


Phe 


Lys 


Tyr 


Phe 


Asn 
105 


Glu 


Lys 


Gin 


Pro 


Arg 
110 


Leu 


He 


Ser 


Phe 


Xaa 


Gly 


Arg 


Gly 


Phe 


Gly 


Tyr 


Ser 


Pro 


Thr 


His 


Ala 


Gin 


Ser 



115 120 125 



Pro 



(2) INFORMATION FOR SEQ ID NO:158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f ea ture 

(B) LOCATION 1. . .307 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

Met Ala Lys Lys Lys lie Ala He Ser Cys Gly Asp lie Gin Gly Val 

15 10 15 

Gly Leu Glu Leu He Leu Lys Ser His Lys Glu Val Ser Ala Leu Cys 

20 25 30 

Glu Pro Leu Tyr Leu Val His Ser Glu Leu Leu Glu Arg Aia Asn Gin 
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156 







35 










40 










45 








Leu 


Leu 


Asp 


Asn 


Ala 


Tyr 


Glu 


Thr 


Lys 


Thr 


Leu 


Asn 


Ala 


He 


Ala 


He 




50 










55 










60 








Asp 


Ala 


Pro 


Leu 


Pro 


Leu 


Leu 


Asn 


Ser 


Ser 


Thr 


He Gly Lys 


Val 


Ser 


65 


Gin 








70 










75 










80 


Thr 


Ser 


Gly Ala 


Tyr 


Ser 


Phe 


Glu 


Ser 


Phe 


Lys 


Lys 


Ala 


Cys 


Glu 


Leu 


Ala 






85 










90 








95 




Asp 


Ser 


Lys 


Glu 


Val 


Asp Gly 


He 


Cys 


Thr 


Leu 


Pro 


He 


Asn 






Ala 


100 










105 










110 






Lys 


Leu 


Trp 


Gin 


Gin 


Ala 


Gin 


He 


Pro 


Phe 


Val 


Gly His 


Thr 


Asp 


Phe 




115 










120 










125 






Leu 


Lys 


Gin 


Arg 


Tyr 


Lys 


Asp 


His 


Gin 


He 


He 


Met 


Met 


Leu 


Gly 




130 










135 










140 








Cys 


Ser 


Lys 


Leu 


Phe 


Val 


Gly 


j-»eu 


rflG 


Ser 


Asp 


His 


Val 


Pro 


Leu 


Ser 


145 


Val 








150 










155 










160 


Ala 


Ser 


Gin 


Leu 


He 


Gin 


Val 


Lys 


Ala 


Leu 


Val 


Lys 


Phe 


Leu 


Leu 


Ala 


Phe 






165 










170 








175 




Glu 


Ly » 


Sei 


Tlii. 


Glii 


Ala 


Lys 


I It: 


Val 


Gin 


Val 


Cys 


Gly 


Fiie 








180 










185 










190 




Asn 


Pro 


His 


Ala Gly Glu Glu 


Gly 


Leu 


Phe 


Gly Glu Glu Asp Glu Lys 


lie 




195 










200 










205 








Leu 
210 


Lys 


Ala 


He 


Gin 


Glu 
215 


Ser 


Asn 


Gin 


Thr 


Leu Gly 
220 


Phe 


Glu 


Cys 


Phe 


Leu 


Gly 


Pro 


Leu 


Pro 


Ala 


Asp 


Ser 


Ala 


Phe 


Ala 


Pro 


Asn 


Lys 


Arg 


225 


He 








230 










235 








240 


Lys 


Thr 


Pro 


Phe 


Tyr 


Val 


Ser 


Met 


Ser 


His 


Asp Val 


Gly Leu 


Ala 










245 










250 










255 




Pro 


Leu 


Lys 


Ala 


Leu 


Tyr 


Phe 


Asp 


Glu 


Ser 


He 


Asn 


Val 


Ser 


Leu 


Asn 


Ala 






260 










265 










270 






Pro 


He 


Leu 


Arg 


Ala 


Ser 


Thr 


Asp 


His 


Gly 


Thr 


Ala 


Phe 


Asp 


He 


Ala 




275 










280 










285 






Tyr 


Gin 


Asn 


Lys 


Ala 


Asn 


His 


Lys 


Ser 


Tyr 


Leu 


Asn 


Ala 


He 


Lys 




290 










295 










300 








Tyr 


Leu 


Ala 





























305 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 
(BJ TYPE: amino acid 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .146 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 



Met 


Ser 


Ser Gly 


Leu 


He 


Tyr 


He 


Ser 


Leu 


Glu 


Val 


Leu 


Val 


Xaa 


Cys 


1 


He 


Thr Ala 


5 










10 










15 


Leu 


Leu 


He 


Met 


Tyr 


Tyr 


Val 


Met 


Lys 


Lys 


He 


Tyr 


Tyr 


Ala 




20 










25 










30 


Arg Gly Gin 


Ala 


He 


Leu 


Lys 


Gly Ala 


Ser 


Ala 


Lys 


Ala 


Lys 


Leu 














40 










45 






Met 


Glu 
50 


Phe Gin 


Ala 


Lys 


Ser 
55 


Phe 


Val 


Glu 


Ala 


Glu 
60 


Glu 


Met 


Arg 


Met 


Lys 


Ser 


Gin Glu 


Cys 


Lys 


Leu 


Gin 


Gin 


Gin 


Tyr 


Glu 


Asn 


Lys 


Asn 


Leu 


65 








70 










75 








80 


Gin 


Leu 


Gin Thr 


His 


Phe 


Asp 


Lys 


Lys 


Glu 


Ala 


His 


Leu 


Lys 


His 


Leu 


Glu 


Ala 




85 










90 








95 




Gin His 


Lys 


Glu 


Phe 


Val 


Arg 


Asp 


Glu 


Lys 


Arg 


Tyr 


Leu 


Glu 
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100 

Lys Glu Lvs Lys 
115 

Arg Glu Asn Phe 

130 
Lys Ala 
145 



Glu Leu Glu Lys 
120 

Xaa Xaa Gin Arg 
135 



157 

105 

Glu Arg Gin He 

Ala Phe Val Xaa 
140 



110 

Leu Glu Xaa Glu 
125 

Xaa Xaa Xaa Ala 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) .FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .127 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



Met 


Asn 


He 


Lys 


He 


Leu 


Lys 


He 


Leu Val Gly Gly 


Leu 


Phe 


Phe 


Leu 


i 








5 








10 








15 




Ser 


Leu 


Asn 


Ala 


His 


Leu 


Trp 


Gly 


Lys Gin Asp 


Asn 


Ser 


Phe 


Leu 


Gly 








20 










25 






30 






He 


Gly 


Glu 


Arg 


Ala 


Tyr 


Lys 


Ser 


Gly Asn Tyr 


Ser 


Lys 


Ala 


Ala 


Ser 






35 










40 






45 








Tyr 


Phe 


Lys 


Lys 


Ala 


Cys 


Asn 


Asp 


Gly Val Ser 


Glu 


Gly 


Cys 


Thr 


Gin 




50 










55 






60 










Leu 


Gly 


He 


He 


Tyr 


Glu 


Asn 


Gly 


Gin Gly Thr 


Arg 


He 


Asp 


Tyr 


Lys 


65 










70 






75 










80 


Lys 


Ala 


Leu 


Glu 


Tyr 


Tyr 


Lys 


Thr 


Ala Cys Gin 


Ala 


Asp 


Asp 


Arg 


Glu 










85 








90 








95 




Gly Cys 


Phe 


Gly 


Leu 


Gly 


Gly 


Leu 


Tyr Asp Glu 


Gly 


Leu Gly Thr 


Ala 








100 










105 






110 






Gin 


Asn 


Tyr 


Gin 


Glu 


Ala 


Xaa 


Asp 


Ala Tyr Ala 


Arg 


His 


Ala 


Phe 





115 120 125 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1 . . .116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 



Met Ala He Ala lie Lys Asp Leu Leu Ser Ala Tyr Lys Val Val Leu 

15 10 15 

Pro Leu Asp Lys He Ser Met Pro Ser Ser Ala Asp Leu Lys Leu Thr 
20 25 30 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCTAJS96/18542 



158 

Leu Gin Phe Leu Lys Asn Thr Ala Pro Leu Phe Ser Val Gin Gly Ser 

Val Asn Leu Gin Glu Gly Thr IL Ser Leu Tyr Asn iL Pro Leu Tyr 

Thr Gin Ser Ala Gin lie Asn Leu Asp lie Ala Gin Glu Tyr Gin Tyr 

He Tyr He Asp Thr lie His Thr Arg Tyr Ala Asn Met Xaa Asp Leu 

Asp Ala Lys lie Ala Leu Asp Leu Gly IL Lys Asn Leu Ser Xaa Xaa 

x uu 105 i 1 n 

Xaa Leu Gly Pro 110 

115 

(2) INFORMATION FOR SEQ ID NO: 162: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 
!B) TYPE: a.-nino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

fvi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc_feature 
<B) LOCATION 1 ... 82 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



15 

Cys Ala Phe He Leu Il< 
30 

Val Leu Phe Arg Leu Ast 
45 

Phe He Xaa Cys Tyr Le\ 
60 

Cys Ala Leu Glu Lys Arc 
75 80* 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc„feature 

(B) LOCATION 1. . .116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 
Leu Lys Val Thr Asn Pro His Leu Leu Val Val lie Gin Asp Leu Asn 
Ala Arg lie Ala Leu Met Lys Leu Leu IL Gin Asn Val Lys Ser Ala 



Met 


Pro 


Asp 


Asn 


Leu 


His 


Leu 


His 


Thr 


Leu 


1 

Gin 


Arg 


Ser 


Phe 


5 

Asn 


Tyr 


Pro 


Asn 


Pro 


10 
Leu 


Leu 






20 










25 


Cys 


Asn 


Leu 


Pro 


Phe 


He 


Leu 


lie 


Ser 


Ala 




35 










40 




Tyr 


Ala 


Leu 


He 


Val 


He 


Ser 


Leu 


Val 


He 
65 


50 










55 




Gly 


Xaa 


Ala 


Tyr 


Leu 
70 


Asn 


Arg 


Gin 


Val 


Ala 


Phe 
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159 



50 

Ser Va] 
65 



His Phe Cys Ala 
115 





20 










25 








30 




ber 


Glu 


Leu 


Val 


Phe 


Cys 


Asn 


Lys 


Glu 


Lys 


Arg 


Leu He 


Arg 


35 








40 










45 






Ala 


Gin 


Lys 


Glu 


Tyr 


Gly 


He 


Thr 


Pro 


Val 


Glu Asn 


He 


Leu 








55 










60 




He 




Leu 


Asp 


Thr 


Ala 


Met 


Asn 


Pro 


Asn 


Ser 


Ala 


Leu Val 


Asp 






70 










75 








80 


Asn 


Glu 


Ala 


Lys 


Glu 


Leu 


His 


Asp 


Lys 


Val 


Gly Ala 


Glu 


Lys 






85 








90 








95 




Ser 


Phe 


Leu 


Glu 


Lys 


Ala 


Xaa 


Arg 


Gin 


Arg 


Ala Val 


Leu 


Arg 




100 








105 








110 







(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 a it. in 3 acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .198 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 

:ys Thr Ala Cys Ser 
15 

Val Lys Lys He 
30 

Phe Asn Glu Asn 
45 

Lys Leu Gly Tyr 

Phe Phe Ser Pro 
80 

Phe Val Val Tyr 
95 

Pro Glu Ser Leu 
110 

Cys Leu Gin Leu 
125 

Phe Tyr He Gin 

Ser Ser Leu He 
160 

Gly Asn Cys He 
175 

Met Gly He He 
190 



Met 


Lys 


Glu 


Ser 


Phe 


Tyr 


He 


Glu 


Gly 


Met 


Thr 


Cys 


1 






5 










10 






Ser 


Gly 


He 


Glu 


Arg 


Ser 


Leu 


Gly Arg 


Lys 


Ser 


Phe 






20 










25 








Glu 


Val 


Ser 


Leu 


Leu 


Asn 


Lys 


Ser 


Ala 


Asn 


lie 


Glu 






35 










40 








Glu 


Glu 


Thr 


Asn 


Leu 


Asp 


Glu 


lie 


Phe 


Lys 


Leu 


lie 




50 








55 










60 


Ser 


Pro 


Lys 


Lys 


Thr 


Leu 


Ala 


Glu 


Glu 


Lys 


Lys 


Glu 


65 








70 










75 




Asn 


Val 


Lys 


Leu 


Ala 


Leu 


Ala 


Val 


lie 


Phe 


Thr 


Leu 








85 










90 






Leu 


Ser 


Met 


Gly 
100 


Ala 


Met 


Leu 


Ser 


Pro 
105 


Ser 


Leu 


Leu 


Leu 


Thr 


lie 
115 


Asn 


His 


His 


Ser 


Asn 
120 


Phe 


Leu 


Asn 


Ala 


He 


Gly 
130 


Ala 


Leu 


He 


Val 


Met 
135 


His 


Leu 


Gly 


Arg 


Asp 
140 


Gly 


Phe 


Lys 


Ala 


Leu 


Trp 


His 


Arg 


Gin 


Pro 


Asn 


Met 


145 










150 










155 




Ala 


He 


Gly 


Thr 


Ser 


Ala 


Ala 


Leu 


He 


Ser 


Ala 


Cys 








165 










170 






Trp 


Phe 


lie 


Pro 


lie 


He 


lie 


Pro 


lie 


Ser 


Gly 


Leu 






180 










185 








He 


Leu 


Lys 
195 


Ala 


Cys 


Ala 















(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Val 


Pro 


Asp 




5 




Val 


Gly 


Asp 


20 




Thr 


Pro 


Leu 


Leu 


Ala 


Thr 


Pro 


Leu 


Asn 






70 


His 


Val 






85 




ION 


FOR 


SEQ 



160 

(iii} HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .85 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 165 

n Leu Trp Lys Leu Lys Arg Phe Asn Gli 

10 is 
r Leu lie Thr Ala He Gly Gin Gly Sej 

25 30 
i Val Leu Ala Tyr Thr Gly Leu He Ale 
40 45 

50 - " - -° His Phe A la He His Asn Gin Gin Pre 

i Lys Asp Pro Leu Asn Arg Phe Ser Lys Lys Glu Ala Pro Ser Let 
b 70 75 eo 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1. . .343 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 

Met Gin Asn Leu Leu lie Gin Ala Glu Asn Ala He Ala Leu Leu Phe 

Leu Leu Asn Asp Lys Asn Leu Lys Gly lys He Asp Leu He ^r He 

Asp Pro Pro Phe Ala Thr Asn Asn Its Phe Thr He Thr lln Gly Arg 

Ala Thr Thr He Ser Asn Ser Lys Asn Gly Asp He ila Tyr Ser Asp 

55 go 
Lys Val Val Gly Met Asp Phe Met Glu Phe Leu Lys Gin Arg Leu Val 

70 75 
Leu Leu Lys Glu Leu Leu Ser Glu Gin Gly Ser lie Tyr Val His Thr 



Asp Tyr Lys lie Gly His Tyr Val Lys Val Met Leu Asp Glu lie Phe 

00 105 no 

Gly lie Gin Asn Phe Arg Asn Glu He Thr Arg lie Lys Cys Asn Pro 

Lys Asn Phe Lys Arg lie Gly Tyr Gly Asn He Lys III Met lie Leu 

135 140 

Phe Tyr Ser Lys Gly Lys Asn Pro lie Phe Asn Glu Pro Lys lie Pro 

150 155 un 

Tyr Thr Pro Gin Asp Leu Glu Lys Arg Phe Pro Lys lie Asp Lys III 

Lys Arg Arg Tyr Thr Thr Val Pro lie Jis Ala Pro Gly Glu lit Glu 

180 185 190 
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161 



Ser 


Gly 


Glu 
195 


Cys 


Ser 


Lys 


Ala 


Phe 
200 


Lys 


Gly 


Met 


Leu 


Pro 
205 


Pro 


Lys 


Gly 


Arg 


His 


Trp 


Arg 


Thr 


Asp 


lie 


Ala 


Thr 


Leu 


Glu 


Arg 


Trp Asp 


Lys 


Glu 




210 










215 










220 










Gly 


Leu 


lie 


Glu 


Tyr 


Ser 


Asn 


Asn 


Asn 


Asn 


Pro 


Arg 


Lys 


Lys 


He 


Tyr 


225 










230 










235 










240 


Ala 


Leu 


Glu 


Gin 


Val 
245 


Gly 


Lys 


Arg 


Val 


Gin 
250 


Asp 


He 


Trp 


Glu 


Phe 
255 


Lys 


Asp 


Pro 


Gin 


Tyr 
260 


Pro 


Ser 


Tyr 


Pro 


Thr 
265 


Glu 


Lys 


Asn 


Ala 


Gin 
270 


Leu 


Leu 


Asp 


Leu 


He 
275 


He 


Lys 


Thr 


Ser 


Ser 
280 


Asn 


Lys 


Asp 


Ser 


lie 
285 


Val 


Leu 


Asp 


Cys 


Phe 
290 


Cys 


Gly 


Ser 


Gly 


Thr 
295 


Thr 


Leu 


Lys 


Ser 


Ala 
300 


Phe 


Leu 


Leu 


Gin 


Arg 


Lys 


Phe 


He 


Gly 


He 


Asp 


Asn 


Ser 


Asp 


Leu 


Ala 


He 


Gin 


Ala 


Cys 


305 










310 










315 










320 


Lys 


Asn 


Lys 


Leu 


Glu 
325 


Thr 


He 


Thr 


Lys 


Asp 
220 


Leu 


Phe 


Val 


Ser 


Gin 
3 3 5 


Asn 


Phe 


Tyr 


Asp 


Phe 


Leu 


Val 


Phe 





















340 



(2) INFORMATION FOR SEQ ID NO: 167: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .176 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 



Met 


Thr 


Ser 


Val 


Val 


lie 


Lys 


Pro 


His 


Ala 


Tyr 


Gly Glu Gin Val 


Gin 


1 








5 








10 








15 




Glu 


lie 


Glu 


Glu 


Glu 


Set 


Asp 


Ser 


Asp 


Tyr 


Glu 


Lys Asn 


Asn 


Asp 


Gin 








20 










25 








30 






Glu 


Ala 


He 


Asn 


Phe 


Gly 


He 


Ala 


Leu 


His 


Lys 


Gly Leu 


Glu 


Tyr 


Gin 






35 










40 








45 








Tyr 


Ala 


Tyr 


Asn 


lie 


Pro 


Lys 


Gin 


Ser 


Val 


Leu 


Glu Tyr 


Leu 


Asn 


Tyr 




50 










55 










60 








His 


Tyr Gly 


Phe 


Tyr 


Gly 


Leu 


Asp 


Tyr 


Gin 


Ala 


Leu Glu 


Glu 


Ser 


Leu 


65 










70 










75 








80 


Glu 


Leu 


Phe 


Glu 


Asn 


Asp 


Ala 


Gly 


lie 


Gin 


Ala 


Leu Phe 


Lys 


Asn 


His 










85 










90 








95 




Ala 


Leu 


Lys 


Gly 


Glu 


Ala 


Ala 


Phe 


Leu 


Phe 


Gin 


Gly Val 


Val 


Ser 


Arg 








100 










105 








110 






lie 


Asp 


Val 


Leu 


Leu 


Trp 


Asp 


Arg 


Gly 


Gin 


Asn 


Leu Tyr 


Val 


Leu 


Asp 






115 










120 








125 








Tyr 


Lys 


Ser 


Ser 


Gin 


Asn 


Tyr 


Gin 


Gin 


Ser 


His 


Lys Ala 


Gin 


Val 


Ser 




130 










135 










140 








His 


Tyr 


Ala 


Glu 


Phe 


Leu 


Arg 


Thr 


Gin 


Xaa 


Pro 


His Phe 


Lys 


He 


Gin 


145 










150 










155 








160 


Ala 


Gly 


lie 


lie 


Tyr 


Ala 


His 


Lys 


Arg 


Leu 


Leu 


Glu Lys 


Xaa 


Trp 


Xaa 










165 










170 








175 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 168: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .260 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 



Met 


Ser 


Glu 


Asp 


Leu 


Pro 


Phe 


Ala 


Ser 


Asp 


Ser 


Gin 


Phe 


Thr 


Tyr 


Asn 


1 








5 










10 










15 






Vai 


Ssr 


He 


Thr 


Arg 


Trc 


Thr 


Asn 


Glu 


Vd 1 


Asti 


At»p 


Vai 


lie 


Sex 








20 










25 








30 






Gly Val 


Asn 


He 


Thr 


Leu 


Glu 


Gln 


Thr 


Thr 


Glu 


Pro 


Asn 


Lys 


Pro 


Ala 


He 




35 










40 










45 






He 
50 


Ser 


Val 


Ser 


Arg 


Asp 
55 


Asn 


Gin 


Ala 


He 


He 
60 


Asp 


Ser 


Leu 


Lys 


Glu 
65 


Phe 


Val 


Lys 


Ala 


Tyr 

70 


Asn 


Glu 


Leu 


He 


Pro 
75 


Lys 


Leu 


Asp 


Glu 


Asp 
80 


Thr 


Arg 


Tyr 


Asp 


Ala 


Asp 


Thr 


Lys 


He 


Ala 


Gly 


He 


Phe 


Asn Gly 


Val 










85 










90 










95 




Gly Asp 


He 


Arg 


Ala 


lie 


Arg 


Ser 


Ser 


Leu 


Asn 


Asn 


Val 


Phe 


Ser 


Tyr 




Val 


His 


100 










105 










110 




Ser 


Thr 


Asp 


Asn 


Gly 


Val 


Glu 


Ser 


Leu 


Met 


Lys 


Tyr Gly 


Leu 






115 










120 










12 5 








Ser 


Leu 


Asp 


Asp 


Lys 


Gly 


Val 


Met 


Ser 


Leu 


Asp 


Glu 


Ala 


Lys 


Leu 


Ser 




130 










135 








140 








Ser 


Ala 


Leu 


Asn 


Ser 


Asn 


Pro 


Lys 


Ala 


Thr 


Gin 


Asp 


Phe 


Phe 


Tyr 


Gly 


145 


Asp 








150 










155 






160 


Ser 


Ser 


Lys 


Asp 


Met 


Gly 


Gly Arg Glu 


He 


His 


Gin 


Glu 


Gly 


He 


Phe 








165 










170 










175 




Ser 


Lys 


Phe 


Asn 


Gin 


Val 


He 


Ala 


Asn 


Leu 


He 


Asp Gly Gly 


Asn 


Ala 






180 










185 










190 






Lys 


Leu 


Lys 


He 


Tyr 


Glu 


Asp 


Ser 


Leu 


Asp 


Arg 


Asp 


Ala 


Lys 


Ser 


Leu 


Thr 
210 


195 










200 








205 






Lys 


Asp 


Lys 


Glu 


Asn 
215 


Ala 


Gin 


Glu 


Leu 


Leu 
220 


Lys 


Thr 


Arg 


Tyr 


Asn 


He 


Met 


Ala 


Glu 


Arg 


Phe 


Ala 


Ala 


Tyr 


Asp 


Ser 


Gin 


He 


Ser 


Lys 


225 




Gin 






230 










235 










240 


Ala 


Asn 


Lys 


Phe 


Asn 


Ser 


Val 


Gin 


Met 


Met 


He 


Asp 


Gin 


Ala 


Ala 


Ala 








245 










250 








255 




Lys 


Lys 


Asn 
260 

























(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



163 



Met 


Arg 


lie 


Val 


Phe 


Met 


Gly 


Thr 


Pro 


Ser 


Phe 


Ala 


Glu 


Val 


He 


Leu 


1 








5 










10 










15 




Arg 


Ala 


Leu 


Val 


Glu 


Asn 


Glu 


Asp 


Lys 


Lys 


He 


Glu 


Val 


Val 


Gly 


Leu 








20 










25 










30 






Phe 


Thr 


Gin 


Arg 


Asp 


Lys 


Pro 


Phe 


Gly 


Arg 


Lys 


Lys 


Glu 


Leu 


Lys 


Ala 






35 










40 










45 








Pro 


Glu 


Thr 


Lys 


Thr 


Tyr 


lie 


Leu 


Glu 


Asn 


His 


Leu 


Asn 


He 


Pro 


lie 




50 










55 










60 










Phe 


Gin 


Pro 


Gin 


Ser 


Leu 


Lys 


Glu 


Pro 


Glu 


Val 


Gin 


lie 


Leu 


Lys 


Gly 


65 










70 










75 










80 


Leu 


Lys 


Pro 


Asp 


Phe 


lie 


Val 


Val 


Val 


Ala 


Tyr 


Gly 


Lys 


He 


Leu 


Pro 










85 










90 










95 




Lys 


Glu 


val 


Leu 


Thr 


lie 


Ala 


Pro 


Cys 


He 


Asn 


Leu 


His 


Ala 


Ser 


Leu 








100 










105 










110 






Leu 


Pro 


Lys 


Tyr 


Arg 


Gly 


Ala 


Ser 


Pro 


He 


His 


Glu 


Met 


He 


Leu 


Asn 






115 










120 










125 








Asp 


Ac? 


is >-~ 


lie 


rn. > v 

1 J i. 


T . » 


T1 . 




Thr 




Leu 


Ket 


Xaa 




Gly 


He 



130 135 140 



Gly 
145 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 248 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .248 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 170 



Met 


Arg 


Phe 


Tyr 


Phe 


Lys 


Phe 


Leu 


Trp 


Leu 


Leu 


Gly 


He 


Phe 


Leu 


lie 


1 








5 










10 










15 




Phe 


Tyr 


Phe 


Leu 


Asp 


lie 


Lys 


Gly 


Ser 


Ser 


Ser 


Tyr 


He 


Ser 


Asp 


Arg 








20 










25 










30 






Val 


Lys 


Ser 


Ala 


Leu 


Met 


Ser 


Ala 


Lys 


Asn 


Ser 


Leu 


Leu 


Asp 


Asn 


Val 






35 










40 










45 






Gin 


Ala 


Tyr 


Phe 


Phe 


Gin 


Ala 


Gin 


Asn 


He 


Lys 


Glu 


Phe 


Gin 


Lys 


Glu 




50 










55 








60 








Arg 


Leu 


He 


Leu 


Glu 


Ala 


Leu 


Lys 


Leu 


Glu 


Asn 


Ala 


Asp 


Leu 


Lys 


Glu 


65 










70 










75 






80 


Arg 


Leu 


Asn 


Ser 


He 


Tyr 


Pro 


Leu 


Glu 


Asn 


Pro 


Lys 


Met 


Thr 


Tyr 


Thr 










85 










90 










95 




Pro 


Thr 


Phe 


Met 


Thr 


Ser 


Phe 


He 


Asn 


Leu 


Glu 


Asp 


Thr 


His 


Ser 


Val 








100 










105 










110 






Ser 


Leu 


Asn 


Pro 


lie 


Val 


Asn 


Leu 


Glu 


Glu 


Asn 


Lys 


lie 


Tyr 


Gly 


Leu 






115 










120 










125 






Val 


Ser 


His 


Asn 


Gin 


Ala 


lie 


Gly 


lie 


Ala 


Val 


Leu 


Glu 


Lys 


Gly Arg 




130 










135 










140 








Leu 


Asn 


Gly 


Phe 


Leu 


Asn 


Ala 


His 


Lys 


Arg 


Cys 


Ala 


Tyr 


Ser 


Val 


Met 


145 










150 










155 










160 


He 


Gly 


Gin 


Asn 


Gin 


Val 


Leu 


Gly 


Phe 


He 


Gly 


Thr 


Asn 


Phe 


Lys 


Gin 










165 










170 










175 




Glu 


Leu 


Val 


Val 


Asp 


Phe 


He 


Val 


Pro 


Ser 


Ala 


Glu 


lie 


Asn 


lie 


Gly 








180 










185 










190 




Asp 


Gin 


Val 


Leu 


Thr 


Ser 


Gly 


Leu 


Asp 


Gly 


He 


Phe 


Gly Ala 


Gly Val 






195 










200 








205 








Phe 


Val 


Gly Glu 


Val 


Ser 


Ser 


Val 


Glu 


Asp 


His 


Tyr 


Thr 


Tyr 


Lys 


Ser 
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210 215 220 

Ala Val Leu Lys Asn Ala Phe Leu Ser Glu Ala Lys Leu Leu Arg His 
225 230 235 240 

Val Phe Leu Ser Gly Val Lys Asn 
245 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

!vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 



Leu 


Ala 


Leu 


Arg 


Leu 


Pro 


Phe 


Leu 


He 


Ala 


His 


Val 


He 


Asn 


Met 


Phe 


1 








5 










10 










15 




Leu 


Phe 


Tyr 


Leu 


He 


Gly 


Arg 


Lys 


He 


Leu 


Lys 


Lys 


Pro 


Lys 


Asp 


Ala 








20 










25 










30 




Leu 


Tyr 


Val 


Val 


Leu 


Thr 


Tyr 


Ala 


Leu 


Leu 


Pro 


Gly 


Val 


Asn 


Leu 


Phe 






35 










40 








45 








Ala 


He 


Leu 


Leu 


Ala 


Lys 


Ser 


Val 


Leu 


Val 


Leu 


Ser 


Leu Gly 


Leu 


Leu 




50 










55 










60 










He 


Ser 


Tyr 


Leu 


Tyr 


He 


Lys 


Thr 


Gin 


Lys 


He 


Pro 


Tyr 


Leu 


Thr 


Leu 


65 










70 










75 








80 


Ser 


Ala 


Cys 


Ala 


Phe 


Leu 


Asp 


Gly 


Ala 


Phe 


He 


Pro 


Leu 


Leu 


Leu 


Gly 










85 










90 










95 


Val 


Phe 


Ala 


Tyr 


Ala 


Leu 


Arg 


Lys 


Thr 


Ala 


He 


Leu 


Arg 


Ala 


Arg 


Ser 








100 










105 








110 




Leu 


Leu 


Trp 
115 


Trp 


Phe 


Xaa 


Leu 





















(2) INFORMATION FOR SEQ ID NO:172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .108 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

Val Asn Leu Met Asp Tyr Phe Ser Lys Ser Leu Phe Leu Asn Ser Leu 

15 10 15 

Asn Thr Gin Arg Leu He Val Ser Asn Lys Leu Ala He Asp Val Gin 

20 25 30 

Tyr Gly Met Leu Gin Ser Val Arg Lys Asn Tyr Pro Asp Val Val Asp 
35 40 45 
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Gly 


Gly 


Val 


Arg 


Glu 


Gly 


Pro 


Phe 


Trp 


Val 


Leu 


Ala 


Gly Ala 


Leu 


Met 


50 










55 










60 








Pro 


Ser 


lie 


Leu 


He 


Glu 


He 


Gly 


Tyr 


Asn 


Ser 


His 


Ala He 


Glu 


Ser 


65 










70 








75 








80 


Lys 


Arg 


He 


Gin 


Ser 


Lys 


Pro 


Tyr 


Gin 


Lys 


He 


Leu 


Ala Lys 


Gly 


He 








85 










90 








95 




Ala 


Asp 


Gly 


He 
100 


Asp 


Ser 


Phe 


Phe 


Ser 
105 


Lys 


Asn 


Asp 









(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .157 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 



Leu 


Ala 


Ser 


Arg 


Tyr 


Ser 


Val 


Ala 


Val 


Gly 


Asn 


Leu 


Phe 


Ser 


Glu 


His 


1 






5 










10 










15 




Leu 


Tyr 


Asp 


Leu 


Arg 


Asn 


Glu 


Thr 


Met 


Thr 


Asn 


Leu 


He 


Gly 


Phe 


Leu 






20 










25 










30 






Leu 


Val 


Leu 
35 


Ala 


Ser 


He 


Trp 


Val 
40 


Phe 


Phe 


Leu 


Ala 


Leu 
45 


Gly 


Val 


Leu 


Leu 


Gly 
50 


Lys 


Met 


Leu 


Val 


Phe 
55 


Ser 


Gly 


Leu 


Gly 


He 
60 


He 


Asp 


Lys 


Ala 


Leu 


Gly 


Phe 


lie 


Phe 


Ser 


Cys 


Leu 


Lys 


Thr 


Phe 


Leu 


Val 


Leu 


Ser 


Phe 


65 








70 










75 










80 


lie 


Leu 


Tyr 


Ala 


Leu 
85 


Ser 


Lys 


Met 


Asp 


Leu 
90 


Met 


Lys 


Asp 


Ala 


Asn 
95 


Ala 


Tyr 


Leu 


Gin 


Glu 


Lys 


Xaa 


Xaa 


He 


Phe 


Pro 


Thr 


Xaa 


Lys 


Xaa 


Xaa 


Xaa 






100 








105 










110 






Ser 


Lys 


He 
115 


Met 


Arg 


Leu 


Asp 


Gly 
120 


Val 


Lys 


His 


Val 


Glu 
125 


Lys 


Asn 


Leu 


Lys 


Asp 


Asn 


Leu 


Glu 


Glu 


Met 


Ser 


Asp 


Glu 


Val 


Lys 


Asn 


Lys 


Gly 


Ser 


130 










135 










140 










He 


Asp Asn 


Ala 


Lys 


Glu 


Ser 


Phe 


Asn 


Lys 


Gly Tyr 


Gly 









145 150 155 



(2) INFORMATION FOR SEQ ID NO: 174: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

lii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 81 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 
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Leu Ser Lys Gin Ser Ala Asp He 
1 5 
Ser Leu Val Lys Val Leu Ala He 
20 

Trp Gin Asn He Leu Phe Ala Leu 

35 40 
Leu Gly Leu Met Gly Val Ala Ser 

50 55 
Val Gly Val Thr Leu Leu Ala Leu 
g 5 70 
Ala 



Val He Thr Asn Asp Ser Leu Ser 

10 15 
Ala Lys Lys Thr Lys Ser He Thr 
25 30 
Gly lie Lys Ala Val Phe He Val 
45 

Leu Trp Glu Ala Val Phe Gly Asp 
60 

Ala Asn Ser Xaa Arg Thr Met Arc? 

75 so 



(2) INFORMATION FOR SEQ ID NO: 175: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 online acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .80 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 



Met 


Lys 


Asn 


Leu 


Arg 


His 


Phe 


Arg 


Lys 


Leu 


1 








5 






10 


Ser 


Pro 


Leu 


Leu 


Leu 


Gin 


Ala 


Asp 


Met 


Thr 


Glu 






20 








25 




Gin 


Gin 


Leu 


Thr 


Ser 


Pro 


Thr 


Ala 


Lys 


Phe 




35 










40 




Leu Gly 
50 


Leu 


Ala 


He 


Phe 
55 


He Trp Lys 


Glu 
65 


He 


Leu 


Met 


Thr 


Val 
70 


Leu 


Ala 


Leu 


Lys 



15 

Thr Phe Phe Asn Ser He 
30 

Gly He Leu Met Val He 
45 

Asn Leu Asp Arg Trp Lys 
60 

Xaa Val Pro Met Gin Xaa 
75 80 

(2) INFORMATION FOR SEQ ID NO: 176 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 325 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .325 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

Leu Ala Gly Leu Xaa Val Gly Cys Xaa Arg Met Lys Gin Thr Phe Tro 

5 io 15 

Xaa Leu Ser Trp Gly Glu Lys Ser Gin Lys Val Cys Val His Arg Pro 
20 25 30 
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Trp 


Tyr 


Ala 
35 


He 


Trp 


Ser 


Cys 


Asp 
40 


Lys 


Trp 


Glu 


Glu 


Lys 
45 


Thr 


Gin 


Gin 


Phe 


Thr 


Gly 


Asn 


Gin 


Leu 


He 


Thr 


Lys 


Thr 


Trp 


Ala 


Gly Gly 


Asn 


Ala 




50 










55 










60 










Ala 


Asn 


Tyr 


Tyr 


His 


Ser 


Gin 


Asn 


Asn 


Gin 


Asp 


He 


Thr 


Ala 


Asn 


Leu 


65 










70 










75 










80 


Lys 


Asn 


Asp 


Asn Gly 


Thr 


Tyr 


Phe 


Leu 


Ser 


Gly 


Leu 


Tyr 


Asn 


Tyr 


Thr 










85 










90 










95 




Gly 


Gly 


Glu 


Tyr 
100 


Asn 


Gly 


Gly 


Asn 


Leu 
105 


Asp 


He 


Glu 


Leu 


Gly 
110 


Ser 


Asn 


Ala 


Thr 


Phe 


Asn 


Leu 


Gly 


Ala 


Ser 


Ser 


Gly Asn 


Ser 


Phe 


Thr 


Ser 


Trp 






115 










120 










125 






Tyr 


Pro 


Asn 


Gly 


His 


Thr 


Asp 


Val 


Thr 


Phe 


Ser 


Ala 


Gly Thr 


He 


Asn 




130 










135 










140 










Val 


Asn 


Asn 


Ser 


Val 


Glu 


Val 


Gly 


Asn 


Ara 


Val 


Gly 


Ser 


Gly 


Ala Gly 


145 










150 










155 










160 


Thr 


His 


Thr 


Gly Thr 


Ala 


Thr 


Leu 


Asn 


Leu 


Asn 


Ala 


Asn 


Lys 


Val 


Thr 










165 










170 








175 




He 


Asn 


Ser 


Asn 


He 


Ser 


Ala 


Tyr 


Lys 


Thr 


Ser 


Gin 


Val 


Asn 


Val 


Gly 








180 










185 










190 




Asn 


Ala 


Asn 


Ser 


Val 


He 


Thr 


He 


Asn 


Ser 


Val 


Ser 


Leu 


Asn 


Gly Glu 






195 










200 










205 








Thr 


Cys 


Xaa 


Ser 


Leu 


Ala 


Arg 


Val 


Gly 


Val 


Gly Ala Asn 


Cys 


Ser 


Thr 




210 










215 










220 








Ser 


Gly 


Pro 


Ser 


Tyr 


Ser 


Phe 


Lys 


Gly 


Thr 


Thr 


Asn 


Ala 


Thr 


Asn 


Thr 


225 










230 










235 










240 


Thr 


Phe 


Ser 


Xaa 


Ser 
245 


Ser 


Gly 


Ser 


Phe 


Thr 
250 


Phe 


Glu 


Glu 


Asn 


Ala 
255 


Thr 


Phe 


Ser 


Gly 


Ala 


Lys 


Leu 


Asn 


Gly 


Gly 


Ala 


Phe 


Thr 


Phe 


Asn 


Lys 


Lys 








260 










265 










270 




Phe 


Asn 


Ala 


Thr 


Asn 


Asn 


Thr 


Ala 


Phe 


Asn 


Ser 


Gly 


Ser 


Phe 


Thr 


Phe 






275 










280 








285 








Lys 


Gly 


Thr 


Ser 


Ser 


Phe 


Asn 


Gly 


Ala 


Asn 


Phe 


Ser 


Asn 


Ala 


Ser 


Tyr 




290 










295 










300 








Thr 


Phe 


Asn 


Asn 


Gin 


Ala 


Thr 


Phe 


Gin 


Asn 


Ser 


Ser 


Phe 


Asn 


Gly Gly 


305 










310 










315 










320 


Thr 


Phe 


Thr 


Phe 


Asn 
325 

























(2) INFORMATION FOR SEQ ID NO: 177: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1. . .271 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 



Leu 


Leu 


Ser 


Leu 


Val 


Lys 


Gly 


Lys 


Thr 


Met Leu 


Arg 


Ser 


Leu 


Tyr 


Ser 


1 








5 










10 








15 




Ala 


Thr 


Ser 


Gly Met 


Leu 


Ala 


Gin 


Gin 


Thr His 


lie 


Asp 


Thr 


Thr 


Ser 








20 










25 








30 






Asn 


Asn 


He 


Ala 


Asn 


Val 


Asn 


Thr 


Thr 


Gly Phe 


Lys 


Lys 


Ser 


Arg 


Ala 






35 










40 








45 






Asp 


Phe 


Asn 


Asp 


Leu 


Phe 


Tyr 


Gin 


Ala 


Met Gin 


Tyr 


Ala 


Gly Thr 


Asn 




50 










55 








60 










Thr 


Ser 


Asn 


Thr 


Thr 


Leu 


Ser 


Pro 


Asp 


Gly Met 


Glu 


Val 


Gly 


Leu 


Gly 


65 










70 






75 










80 
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Val 


Arg 


Pro 


Ser 


A 1 pk 
r\±d 


Tip 

lie 


i nr 


Lys 


rie l 


Phe 


Ser 


Gin 


Gly 


Ser 


Pro 


Lys 


Glu 


Thr 






AS 
0 J 










90 








95 


Glu 


Asn 


Asn 


Leu 


_ 

Asp 


lie 


ai a 


He 


Thr 


biy 


Lys 


Gly 


Phe Phe 








100 










1UD 






110 






Gin 


Val 


Gin 


Leu 


Pro 


Asp 


Vjiy 


TV*** 

i nr 


i nr 


Ala 


Tyr 


inr 


Arg 


Ser 


Gly 


Asn 






115 




















12 5 






Phe 


Lvs 


Leu 


Asp 


Glu 


("51 n 


Vj iy 


Asn 


Leu 


Val 


Thr 


ser 


Glu 


Gly Tyr 


Leu 




13 0 










1 J j 










140 










Leu 


He 


Pro 


fil n 


i i e 


i nr 


Leu 


Pro 


GlU 


Asp 


Thr 


Thr 


Gin 


Val 


Asn 


lie 


145 










i sn 

IjU 








155 










160 


Gly 


Val 


Asp 


\j ± y 


i nr 


vai 


ber 


va l 


Thr 


Gin Gly 


Leu 


Gin 


Thr 


Thr 


Ser 










1 f c 
ID J 










170 










175 




Asn 


Val 


He 


Gly 


Gin 


He 


Thr 


Leu 


Ala 


Asn 


Phe 


v ax 


Asn 


Pro 


Ala 


Gly 




His 




180 










185 










190 




Leu 


Ser 


Met 


Gly 


Asp 


Asn 


Leu 


Phe 


Ser 


He 


Thr 


Asn 


Ala 


Ser 


Gly 






195 










200 










205 






Asp 


Ala 


He 


Val 


Gly 


Asn 


Pro 


Asp 


Ser 


Gin 


Gly 


Leu 


Gly 


Lys 


Leu 


Arg 




1 U 










215 










220 




Gin Gly 


Phe 


Leu 


Glu 


Leu 


Ser 


Asn 


Val 


Arg 


Leu 


Val 


Glu 


Glu 


Met 


Thr 


225 










230 








235 










240 


Asp 


Leu 


He 


Thr 


Ala 


Gin 


Arg 


Ala 


Tyr 


Glu 


Ala 


Asn 


Ser 


Lys 


Ser 


He 










245 










250 








255 




Gin 


Thr 


Ala 


Asp 


Ala 


Met 


Leu 


Gin 


Thr 


Val 


Asn 


Ser 


Leu 


Lys 


Arg 





260 265 270 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1 . . .90 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 178 



Met 


Gin 


Asn 


Gly 


Tyr 


Tyr 


Ala 


Ala 


Thr 


Gly 


Ala 


Met 


Ala 


Thr 


Gin 


Phe 


1 








5 










10 










15 




Asn 


Arg 


Leu 


Asp 


Leu 


Thr 


Ser 


Asn 


Asn 


Leu 


Ala 


Asn 


Leu 


Asn 


Thr 


Asn 


Gly 






20 










25 










30 






Phe 


Lys 


Arg 


Asp 


Asp 


Ala 


He 


Thr 


Gly 


Asp 


Phe 


Leu 


Arg 


Leu 


Tyr 


Gin 




35 










40 










45 




Glu 


Tyr 


Arg 


Glu 


Gin 


Leu 


Pro 


Leu 


Glu 


Asp 


Gin 


Thr 


Lys 


Ala 


Ser 




50 










55 








60 








Ala 


Lys 


Tyr 


Leu 


Asn 


Arg 


Xaa 


Leu 


Asn 


Arg 


Val 


Pro 


He 


Leu 


Ser 


Xaa 


65 










70 








75 










80 


He 


Tyr 


Thr 


Xaa 


Arg 


Xaa 


Leu 


Gly Xaa 


Val 















85 90 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION I. . .195 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 



Val 


Glv 


Ala 


Met 


Pro 


Thr 


He 


Gin 


He 


Arg 


Xaa 


Phe 


Gly Ala 


Gly Gly 


1 








5 










10 










15 




Ser 


Glv 


His 


Ser 


Asp 


Ala 


Thr 


Leu 


Met 


Leu 


Val 


Asn 


Gly 


He 


Pro 


V CI X 








20 










25 










30 






i yi 


Met 


Ala 


Pro 


Tvr 


Ala 


His 


He 


Glu 


Leu 


Asp 


He 


Phe 


Pro 


v a x 


Thr 




35 










40 










45 








Phe 


Gin 


Ala 


lie 


Asp 


Arg 


He 


Asp 


Val 


He 


Lys 


Gly 


Gly Gly 


Ser 


Val 




50 










55 










60 










Gin 


Tyr 


G j. y 


Pro 


Asn 


Thr 


Tyr 


Gly 


Gly 


He 


Val 


rvo it 


He 


11* 


Tin 


Ly i> 


65 










70 










75 










80 


Pro 


lie 


Pro 


Asn 


Gin 


Trp 


Glu 


Asn 


Gin 


Ala 


Ala 


Glu 


Arg 


Xaa 


Thr 


Tyr 










85 










90 










95 


Trp 


Ala 


Lys 


Ala 


Arg 


Asn 


Ala 


Gly 


Phe 


Ala 


Ala 


Pro 


Xaa 


Asp 


Lys 


Thr 








100 










105 










110 






Gly 


Asp 


Pro 


Ser 


Phe 


lie 


Lys 


Ser 


Leu 


Gly Asn 


Asn 


Leu 


Leu 


Tyr 


Asn 






115 










120 










125 








Thr 


Tyr 


Val 


Arg 


Ser 


Gly 


Gly 


Met 


lie 


Asn 


Lys 


His 


Val 


Gly 


He 


Gin 




130 










135 










140 










Arg 


Lys 


Leu 


Thr 


Gly 


Leu 


Glu 


Ala 


Lys 


Ala 


Leu 


Gly 


Thr 


He 


Ala 


Pro 


145 










150 










155 










160 


Leu 


Val 


Phe 


Gin 


Thr 


He 


Gly 


Trp 


Met 


Gly 


Ser 


Met 


Thr 


Ser 


Met 


Lys 










165 










170 










175 




Ala 


Met 


Gly 


Leu 


Lys 


Pro 


He 


Thr 


Asn 


Thr 


Thr 


He 


Leu 


Ala 


He 


Xaa 








180 










185 










190 






Gin 


Pro 


Gly 
































195 




























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:180: 

















<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .84 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 



Met 


Arg 


Xaa 


Glu 


Lys 


He 


Met 


Thr 


Asn 


Phe 


Glu 


Lys 


Xaa 


lie 


Ala 


Gin 


1 








5 










10 










15 




Asn 


Arg 


Leu 


Lys 


Thr 


Asn 


Ala 


Val 


Leu 


Thr 


Thr 


Tyr 


Cys 


Ala 


lie 


Phe 








20 










25 










30 






Ala 


Phe 


He 


Gly 


Leu 


Leu 


Val 


Asp 


Ala 


lie 


Arg 


He 


Asn 


Ala 


Asn 


Asp 






35 










40 










45 








Leu 


Gly 


lie 


Ala 


Leu 


Phe 


Lys 


Leu 


Met 


Thr 


Phe 


Gin 


lie 


Phe 


Pro 


Thr 




50 










55 










60 










Xaa 


Thr 


lie 


Val 


Met 


Phe 


Val 


Val 


Ala 


Phe 


Val 


lie 


Xaa 


Xaa 


Ser 


Leu 


65 










70 










75 










80 


Tyr 


Pro 


Lys 


Phe 
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(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_£ eature 

(B) LOCATION 1. . .76 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

He He Lys Ser He Glu Ala Leu Asp Asp Tyr 

10 is 
Leu Asn Gly Pro Glu Ala Pro Phe Leu Ala Asn 

25 30 
Leu Ser He Leu Ser Lys Asp Tyr Ala Asp Tyr 

40 45 
Lys Lys Asp Glu Leu Ala Lys Xaa Pro Val Gly 

55 60 
Phe Phe Leu Trp Asn Lys Arg 
70 75 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION 1. . .196 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 



Leu 


Met 


Arg 


Lys 


He 


Phe 


Ser 


Tyr 


He 


Ser 


Lys 


Val 


Leu 


Leu 


Phe 


He 


1 




Val 




5 










10 










15 




Gly Val 


Tyr 


Ala 


Glu 


Pro 


Asp 


Ser 


Lys 


Val 


Glu 


Ala 


Leu 


Glu 


Gly 


Arg 


Lys 


Gin 
35 


20 










25 










30 




Glu 


Ser 


Ser 


Leu 


Asp 
40 


Lys 


Lys 


He 


Arg 


Gin 
45 


Glu 


Leu 


Lys 


Ser 


Lys 
50 


Glu 


Leu 


Lys 


Asn 


Lys 
55 


Glu 


Leu 


Lys 


Asn 


Lys 
60 


Asp 


Leu 


Lys 


Asn 


Lys 
65 


Glu 


Glu 


Lys 


Lys 


Glu 
70 


Thr 


Lys 


Ala 


Lys 


Arg 
75 


Lys 


Pro 


Arg 


Ala 


Glu 
80 


Val 


His 


His 


Gly 


Asp Ala 


Lys 


Asn 


Pro 


Thr 


Pro 


Lys 


He 


Thr 


Pro 


Pro 


Lys 


He 






85 










90 










95 


Lys 


Gly 


Ser 


Ser 


Lys 


Gly 


Val 


Gin 


Asn 


Gin Gly 


Val 


Gin 


Asn 


Asn 


Ala 




100 










105 










110 


Pro 


Lys 


Pro 


Glu 


Glu 


Lys 


Asp 


Thr 


Thr 


Pro 


Gin 


Ala 


Thr 


Glu 






115 










120 










125 




Lys 


Asn 
130 


Lys 


Glu 


Thr 


Ser 


Pro 
135 


Ser 


Ser 


Gin 


Phe 


Asn 
140 


Ser 


He 


Phe 


Gly 



Met Xaa Met Ser His 
1 5 
Thr He Arg Phe Thr 
20 

Leu Glv Met Asp Phe 
35 

Leu Ala Gin Asn Asn 
50 

Thr Gly Pro Phe Lys 
65 
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Asn Pro Asn Asn Ala Thr Asn Asn Thr Leu Glu Asp Lys Val Val Gly 
145 150 155 160 

Gly lie Ser Leu Leu Val Asn Gly Ser Pro lie Thr Leu Tyr Gin lie 

165 170 175 

Gin Glu Glu Gin Glu Lys Ser Lys Val Ser Xaa Ala Xaa Ala Arg Asp 

180 185 190 

Arg Leu Xaa Xaa 
195 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .179 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 



Met Lys Arg Ser 
1 

Val Gly Cys Ser 
20 

Ser Thr Lys Ala 
35 

Glu Lys Glu Glu 
50 

Pro Ala lie Glu 
65 

Asp Lys Tyr Glu 

Val Gin Lys Ala 
100 

Thr Asp Glu Phe 
115 

Arg Thr Leu Ser 
130 

Asp Met lie Lys 
145 

Gin Lys Thr Arg 
Leu Val Lys 



Ser 


Val 


Phe 


Ser 


5 








His 


Lys 


Met 


Asp 


Val 


Gin 


Thr 


Ala 








40 


Pro 


Lys 


Gin 


Glu 






55 




Ser 


Gly 


Thr 


He 




70 






He 


Lys 


Glu 


Ser 


85 








Lys 


Glu 


Asn 


His 


Gly 


Ser 


Ser 


Glu 








120 


Val 


Lys 


Asn 


Ala 






135 




Thr 


He 


Ser 


Phe 




150 






Glu 


Cys 


Tyr 


Arg 


165 









Phe 


Leu 


Val 


Ala 




10 






Asn 


Lys 


Thr 


Val 


25 








Pro 


Val 


Thr 


Thr 


Pro 


Ala 


Pro 


Val 








60 


He 


Ala 


Ser 


He 






75 




Asp 


Gin 


Glu 


Thr 




90 






Met 


Gin 


Val 


Leu 


105 








Tyr 


Asn 


Gin 


Ala 


Leu 


Val 


He 


Lys 








140 


Gly 


Glu 


Ser 


Lys 






155 




Glu 


Asn 


Arg 


Arg 




170 







Phe 


Leu 


Leu 


Val 






15 




Ala 


Gly 


Asp 


Val 




30 






Glu 


Pro 


Ala 


Pro 


45 








Val 


Glu 


Glu 


Lys 


Tyr 


Phe 


Asp 


Phe 








80 


Leu 


Asp 


Glu 


He 






95 




Leu 


Glu 


Gly 


Asn 




110 






Leu 


Gly 


Val 


Lys 


125 








Gly 


Val 


Glu 


Lys 


Pro 


Lys 


Cys 


Val 








160 


Val 


Asp 


Val 


Lys 






175 





(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 
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(A) NAME/ KEY : misc.f eature 

(B) LOCATION 1. . .286 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 



Met 


Gly 


Thr 


Leu 


He 


Glu 


Lys 


Trp 


Phe Gly 


Phe 


Ser 


Gin 


He 


Ara 


Glu 


1 








5 










10 










15 




Glu 


Leu 


Glu 


Ala 


Arg 


He 


Ser 


Glu 


Leu 


Glu 


Asp 


Glu 


Asn 


Thr 


Glu 


Leu 








20 










25 








30 






Leu 


Arg 


Glu 


Arg 


Glu 


Tyr 


Leu 


Ala 


Ala 


Glu 


Thr 


Ser 


Glu 


Leu 


Lys 


Asp 


Ala 




35 










40 










45 




Asn Asp 


Gin 


Leu 


Arg 


Gin 


Lys 


Asn Asp 


Lvs 


Leu 


Phe 


lie 


Thr 
















55 










60 








Asp 


Lys 


Leu 


Thr 


Lys 


Glu 


Asn 


Thr 


Glu 


Leu 


Phe 


Ala 


Glu 


Asn 


U1U 




65 










70 










75 










80 


Leu 


Cat* 


Vdl 


Lys 


He 


Ser 


Gly 


Leu 


Glu 


His 


Ser 


Asn 


Asp 


Gin 


Leu 


Trp 










85 










90 










95 


r>t .» 


Asn 




aSh 

100 




uc'u 


Thr 


Lvs 


Glu 
105 


Lys 


A 1 a 


Glu 


r ... 


t . ... 

Li 'y S 

110 


Tit . 

i ill 


u 


Lys 


Asp 


He 


Leu 


Ala 


Lys 


Glu 


Asn 


Thr 


Arg 


Leu 


Leu 


Ala 




Arg 


Asp 






115 










120 










± 4 -> 




Arg 


Leu 


Thr 


Glu 


Glu 


Lys 


Ara 


Glu 


Leu 


Thr 


Thr 




Lys 




Arg 


Leu 




130 










135 










140 






Lys 


Arg 


Glu 


Asn 


Thr 


Glu 


Leu 


Thr 


His 


Lys 


Tie 


1 nr 


VjXU 


Leu 


Tnr 


Lys 


145 










150 










15 5 










1 0 (J 


Glu 


Asn 


Lys 


Ala 


Leu 


Thr 


Thr 


Glu 


Asn 


Asp 


Lvs 


Leu 


Asn 


His 


0111 


Val 


Thr 


Ala 






165 










170 










175 




Leu 


Thr 


Asn 


Glu 


Arg 


Asp 


Ser 


Leu 


Glu 


Gin 


Glu 


Arg 


Ala 


Arg 








180 










185 










190 




Leu 


Gin 


Asp 


Ala 


His 


Gly 


Phe 


Leu 


Glu 


Lys 


Arg 


Cys 


Thr 


Asn 


Leu 


Glu 




Glu 


195 










200 








205 








Lys 


Asn 


Gin 


Arg 


Leu 


Thr 


Asp 


Lys 


Leu 


Lys 


Gin 


Leu 


Glu 


Ser 


Ala 


Gin 


210 










215 








220 










Lys 


Ser 


Leu 


Glu 


Asn 


Thr 


Asn 


Asn 


Gin 


Leu 


Arg 


Gin 


Ala 


Leu 


Glu 


225 










230 










235 








240 


Asn 


Ser 


Asn 


Val 


Gin 


Leu 


Ala 


Gin 


Ala 


Lys 


Glu 


Xaa 


He 


Ala 


He 


Glu 


Xaa 








245 










250 










255 




Ser 


Glu 


Leu 


Xaa 


Arg 


Arg 


Asn 


Arg 


Thr 


Leu 


Glu 


Glu 


Leu 


Arg 


Gly 








260 










265 










270 


Tyr Gly 


Ser 


Gin 


Lys 


Xaa 


He 


Trp 


Thr 


Tyr 


Thr 


Xaa 


Gly Val 










275 










280 










285 









(2) INFORMATION FOR SEQ ID NO: 185 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

Val Leu Arg Lys Leu Leu Gly Lys Asn Cys He Glu Thr His Lys Gly 

15 10 15 

Val Gly Tyr Arg Leu Thr His Tyr Glu Lys Lys Ser Leu Lys Leu Phe 

20 25 30 

Leu Gly Thr Tyr Leu Gly Ser Ser Phe Val Leu Met Leu Val He Ser 

35 40 45 

Val Leu Ala Phe Asn Tyr Glu Lys Asn Glu Lys He Lys Xaa He Arg 
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50 55 
Met Asp Met Asp Lys Met Ala Ser Lys 
65 70 
Leu His Met Gin Thr His Ala Asp Tyr 
85 

lie Ser Arg Tyr Lys Asp Val Ser He 
100 105 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .124 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 



Leu 


Met 


Thr 


Lys 


Ser 


Leu 


Lys 


Leu 


He 


Gin 


Lys 


Gly Val 


Lys Asn 


Leu 


1 






5 










10 








15 




Tyr 


Glu 


Thr 


Leu 


Lys 


Asn 


Arg 


Ala 


Leu 


Glu 


His 


Gin 


Asp 


Thr Leu 


Met 






20 








25 










30 




Val 


Gly 


Arg 


Ser 


His 


Gly 


Val 


Phe 


Gly 


Glu 


Pro 


He 


Thr 


Phe Gly 


Leu 




35 










40 










45 






Val 


Leu 


Ala 


Leu 


Phe 


Ala 


Asp 


Glu 


He 


Lys 


Arg 


His 


Leu 


Lys Ala 


Leu 




50 










55 










60 








Asp 


Leu 


Thr 


Met 


Glu 


Phe 


He 


Xaa 


Val 


Gly 


Ala 


He 


Ser Gly Ala 


Met 


65 










70 










75 








80 


Gly Asn 


Phe 


Ala 


His 


Ala 


Pro 


Leu 


Glu 


Leu 


Glu 


Glu 


Leu 


Ala Cys 


Gly 










85 










90 








95 




Phe 


Leu 


Gly 


Leu 


Lys 


Thr 


Ala 


Asn 


He 


Ser 


Asn 


Gin 


Val 


He Gin 


Arg 






100 








105 










110 




Asp 


Arg 


Tyr 


Ala 


Gly 


Leu 


His 


Ala 


He 


Trp 


Leu 


Phe 









115 120 
(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

Leu Ser Asp Ala Ser Lys Arg Ser Leu Asn Pro Thr Leu Met Met Asn 
15 10 15 

Asn Asn Asn Thr Leu Pro Lys Pro Leu Glu Glu Ser Leu Asp Leu Lys 
20 25 30 



60 

He Ala Ser Glu He He Gin 

75 80 
His Asn Ala Leu Asn Ala Leu 
90 95 
Xaa Leu Xaa Asp Thr 
110 
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Glu Phe lie Ala Leu Phe Lys Thr Phe Phe Ala Lys Glu Arg Gly Ser 

35 40 45 

He Ala Leu Glu Asn Asp Leu Lys Gin Ala Phe Thr Tyr Leu Asn Glu 

50 55 go 

Val Asp Ala lie Gly Leu Pro Ala Pro Xaa Lys Arg Glu Arg Lys Arg 

5 70 75 80 

Ser Tyr Cys Cys Gin Thr His Gin He Arg Asp Ala Pro Phe Arg 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc feature 

(B) LOCATION 1. . .80 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 

Leu Pro He lie Leu Xaa Val He Val Met Met Phe Phe Ser Lys He 

5 10 15 

Val Gly Asp Phe lie Glu Lys His Tyr Arg Val Lys Thr Leu Ala Phe 

20 25 30 

Val Phe Leu Leu Val Val Gly Val Phe Leu Phe Leu Glu Gly Leu His 

35 40 45 

Leu His He Asn Lys Asn Tyr Leu Tyr Ala Gly He Gly Phe Ala Leu 

Leu He Glu Cys Leu Xaa He Phe lie Glu Lys Lys Met Lys Lys Ser 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .265 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 

Met He Lys Ala Arg Phe Lys Lys Arg Leu Leu Gly Ser Arg Gly Ala 

1 ' 5 io 15 

Phe Asp Leu Asn He Asp Leu Glu He Lys Glu Ala Glu Val Val Ala 

Leu Leu Gly Glu Ser Gly Ala Gly Lys Ser Thr He Leu Arg He Leu 

35 40 45 

Ala Gly Leu Glu Ala Val Ser Ser Gly Tyr He Glu Ala Asn His Ser 

-*° 55 go 

Val Trp Leu Asp Thr Gin Lys Lys He Phe Leu Lys Pro Gin Gin Arg 
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65 










70 






75 










80 


Lys 


He 


Gly 


Phe 


Val 


Phe 


Gin Asp Tyr 


Ala 


Leu 


Phe 


Pro 


His 


Leu 


Asn 






85 






90 










95 




Val 


Tyr 


Gin 


Asn 


He 


Ala 


Phe Ala His 


Pro 


Lys 


Asp 


Lys 


Asn 


Lys 


He 






100 






105 










110 






His 


Glu 


Val 


Leu 


Arg 


Leu 


Met Arg Leu 


Glu 


Asn 


Leu 


Ser 


Gin 


Gin 


Lys 






115 








120 








125 








He 


Pro 


Lys 


Leu 


Ser 


Gly 


Gly Gin Ala 


Gin 


Arg 


Val 


Ala 


Leu 


Ala 


Arg 




130 










135 






140 










Ala 


Leu 


He 


Ala 


Ala 


Lys 


Asn Leu Leu 


Leu 


Leu 


Asp 


Glu 


Pro 


Leu 


Asn 


145 










150 






155 










160 


Ala 


Leu 


Asp 


Asn 


Ala 


Leu 


Lys Asn Glu 


Val 


Gin 


Gin 


Gly 


Leu 


Leu 


Asp 








165 




170 










175 




Phe 


He 


Lys 


Arg 


Glu 


Asn 


Leu Ser Val 


Leu 


Leu 


Val 


Ser 


His 


Asp 


Pro 






180 






185 










190 






Asn 




Tip 

lie 


i. fix 


L*y s 


L»eu 


Ala Arg Thr 


Phe 


Leu 


Phe 


Leu 


Asn 


Asn 


Glv 






195 








200 








205 








Val 


He 


Asp 


Pre 


Acn 


Cln 


Glu Acr. Arg 


Leu 


Phc 


Scr 


At^ 


Arg 




Leu 




210 










215 






220 










Val 


Lys 


Pro 


Leu 


Phe- 


Glu 


Asp Glu Asn Tyr 


Cys 


His 


Tyr 


Glu 


Val 


He 


225 








230 






235 










240 


Pro 


Gin 


Thr 


He 


Ser 


Leu 


Pro Lys Asp Cys 


Leu 


Asn 


Pro 


Thr 


Phe 


Lys 










245 






250 










255 




Leu 


Asp 


Phe 


He 


Gin 


Asn 


Lys Lys Phe 






















260 






265 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 190: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .64 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 



Val 


Lys 


Phe 


Ser 


Val 


Leu 


Thr Leu Phe Pro 


Gin Leu lie 


Leu 


Pro 


Tyr 


1 






5 




10 






15 




Phe 


Glu 


Asp 


Ser 


He 


Leu 


Lys Arg Ala Leu 


Glu Lys Asn 


Leu 


Phe 


Glu 








20 






25 




30 






Leu 


Glu 


Val 


Leu 


Asn 


Leu 


Arg Asp Phe Ser 


Ala Asn Lys 


Tyr 


Gin 


Lys 






35 








40 


45 








Ala 


Xaa 


Ser 


His 


Ala 


His 


Trp Trp Gly Cys 


Gly Ala Asn 


Phe 


Arg 


Pro 




50 










55 


60 








(2) 


INFORMATION 


FOR 


SEQ 


ID NO:191: 











(ij SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .138 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 



Leu 


Trp 


Arg 


Thr 


Pro 


Lys 


i nr 


pro 


Leu 


Val 


lie 


1 








5 










1 u 


Ser 


Met 


Ser 


Asp 


Ser 


Glu 


He 


Phe 


Ala 


Xaa 


Xaa 








20 










25 






Ser 


Val 


Xaa 


Gly 


Pro 


Val 


Leu 


Ala 


Gly 


Tyr 


Ala 






35 










40 






Leu 


Pro 


Tyr 


Leu 


He 


Ala 


Ala 


Ser 


Phe 


Met 


Ser 




50 










55 






Leu 


Phe 


Ala 


Lys 


Thr 


He 


Tyr 


Pro 


Gin 


Asn 


Glu 


65 










70 










75 


Ala 


Asp 


Val 


Ser 


Ala 

OS 


Glu 


Glu 


His 


Val 


Asn 

O A 


He 


Xaa Gly Ala 


Ser 


Thr 


Gly 


Xaa 


His 


Leu 


y \J 

Ala 


Leu 








100 










105 






Leu 


Leu 


Ala 


Phe 


Val 


Gly 


Met 


Val 


Ala 


Leu 


Val 






115 










120 








Val 


Val 


Gly 


Gly 


Phe 


Leu 


Gly 


Met 


Glu 


His 






130 










135 











15 

Cys Val Gly Met Al< 
30 

Ser Met Gly lie Pr< 
45 

Ala Pro Gly Gly Lei 
60 

Thr He Ser Ser His 
80 

He Glu Ala Xaa Ale 

OZ 

His Val Gly Ala Met 
110 

Asn Gly Leu Leu Gl} 
125 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 
<B) LOCATION 1 . . .116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 

Val Cys Asn Val Va: 
15 

Pro Val Glu Gly Al< 
30 

Ser Phe Tyr Gly Pre 
45 

Tyr Ala Ala lie Asr 
60 

Trp Tyr Ser Leu Pru 
80 

Ser His Tyr Ser Asf 
95 

Glu Gly Asp Trp Tr% 
110 



Val 


Met 


Asn 


Phe 


Phe 


Val 


Gly 


Gly 


Leu 


Ser 


He 


1 


He 






5 








10 




Val 


Thr 


Tyr 


Ser 


Ala 


Leu 


His 


Pro 


Thr 


Ala 


Glu 






20 










25 






Asp 


lie 


Val 


Gin 


Val 


Ser 


His 


His 


Leu 


Thr 


Ala 




35 










40 








Thr Gly 


Leu 


Leu 


Phe 


Gly 


Phe 


Thr 


Tyr 


Leu 


His 


50 










55 








Thr 


Phe 


Gly 


Leu 


Asp 


Trp 


Arg 


Pro 


Tyr 


Ser 


65 










70 






75 


Val 


Ala 


He 


Asn 


Thr 


Val 


Pro 


Ala 


Ala 


He 


Leu 


Met 








85 










90 




Leu 


Asp 


Asp 
100 


His 


Lys 


Val 


Leu 


Gly 
105 


He 


Thr 


Ala 


He 


■He 
115 


Xaa 

















(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .227 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



vai 


Leu 


Leu 




Lys 


H i c 

n x s 


Cnv- 

Od 




Ala 
Ala 


Gly Leu Leu Ser Ala Leu 


Xaa 


1 






c 
J 










10 15 




Ala 


Leu 


Ser 


Phe 


Gly 


bet 


\ji,y 


V a JL 


Vol 


Ser He Gin Ala Leu Glu 


Cys 








z u 










30 




GlU 


lie 


'i'nr 




Asn 


Asn 


Lys 


Pro 


Leu 


Glu Leu Vai Phe Cys Glu 






35 










40 




45 




Phe 


Pro 


Lys 


Lys 


Leu 


Ser 


Ala 


Phe 


Ala 


Leu Gly Met Gly Leu Glu Asn 




50 






55 






60 




He 


Pro 


Lys 


Asp 


Phe 


Lys 


Lys 


Trp 


Leu 


Glu Leu Ala Pro Cys Val 


Leu 


65 








70 








75 


80 


Asp 


Ala 


Gly 


Val 


Phe 


Tyr 


His 


Lys 


Glu 


Val Leu Gin Ala Leu Glu 


Lys 








85 










90 95 




Glu 


Val 


He 


Leu 


Thr 


Pro 


His 


Pro 


Lys 


Glu Phe Leu Ser Leu Leu 


Lys 








100 










105 


110 




Ser 


Val 


Gly 


He 


Asn 


He 


Ser 


Met 


Leu 


Glu Leu Leu Asp Asn Lys 


Leu 






115 










120 




125 




Glu 


He 


Ala 


Arg 


Asp 


Phe 


Ser 


Gin 


Lys 


Tyr Pro Lys Val Val Leu 


Leu 




130 










135 






140 


He 


Leu 


Lys 


Gly 


Ala 


Asn 


Thr 


Leu 


lie 


Ala 


His Gin Gly Arg Val Phe 


145 






150 








155 


160 


Asn 


Asn 


Leu 


Gly 


Ser 


Val 


Ala 


Leu 


Ala 


Lys Ala Gly Ser Gly Asp Val 








165 










170 175 




Leu 


Ala 


Gly 


Leu 


He 


Val 


Ser 


Leu 


Leu 


Ser Gin Asn Tyr Thr Pro 


Leu 






180 










185 


190 




Xaa 


Ala 


Ala 


He 


Asn 


Ala 


Ser 


Leu 


Ala 


His Ala Leu Ala Gly Leu 


Xaa 






195 










200 




205 




Phe 


Lys 


Asn 


Xaa 


Xaa 


Ala 


Leu 


Thr 


Pro 


Xaa Asp Leu He Glu Lys 


Xaa 




210 










215 






220 




Lys 


Arg 


Leu 



















225 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 109 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 

Val Xaa Leu Tyr Leu Ala Leu Thr Leu Ser Leu Gly He Ala Met Leu 

1 5 10 15 

Leu Val Glu Met Leu He Gly Asn Leu Gly Lys Lys Asp Val Val Ser 

20 25 30 

Asn Tyr Gin He Leu Asp Pro Lys Arg Lys Lys Tyr Tyr Pro Phe Thr 
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3 5 40 
Ser Phe Phe lie Leu Gly Gly Pro Leu 

50 55 
Val Leu Gly Trp Val Leu Tyr Tyr Leu 
65 70 
Pro Lys Asp Leu Xaa Gin Ala Lys Met < 
85 

Gly Ser Leu He Trp Pro Val He Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) 



(ix) 



(xi) 



Leu 
1 


Thr 


Thr 


Lys 


Ala 

5 


Cys 


Trp 


Leu 


Leu 


Leu 


Asn 


He 


Thr 


He 


Lys 


Asp 


Arg 


Thr 








20 










25 


Lys 


Asp 


Phe 


Ala 


Trp 


Lys 


Lys 


Cys 


Phe 






35 










40 




Leu 


Leu 


Val 


Gly 


Cys 


Ser 


Pro 


His 


He 




50 










55 






Leu 


Lys 


Leu 


Asn 


Tyr 


His 


Pro 


Ala 


Ser 


65 










70 








Glu 


Lys 


He 


Leu 


Leu 


Leu 


Arg 


Pro 


Ala 



85 



Cys 

(2) INFORMATION FOR SEQ ID NO: 196: 
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45 

He Leu Ser Phe Tyr Ala Val 
60 

Phe Val Val Thr Phe Asp Leu 

75 80 
Gin Phe Xaa Met Leu Gin Asn 
90 95 
Phe Ser Ala Cys 



Arg Val Cys Cys Tyr Arg Ser 
10 15 
Met Lys Thr Asn Gly His Phe 
30 

Leu Gly Ala Ser Val Val Ala 
45 

lie Glu Thr Asn Glu Val Ala 
60 

Glu Lys Val Gin Ala Leu Asp 

75 80 
Phe Gin Tyr Ser Xaa Asn He 
90 95 



ORIGINAL SOURCE: 
(A) ORGANISM: Helicobacter pylori 

FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .97 

SEQUENCE DESCRIPTION: SEQ ID NO: 195 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

Leu Ser Glu Trp Gin Thr Phe Cys Leu Lys Asp Leu Gly Lys He Val 
15 10 15 

Gly Gly Ala Thr Pro Pro Thr Asn Asn Pro Lys Asn Tyr Gly Asn Lys 
20 25 30 
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He Ala Trp He Thr Pro Lys Asp Leu Ser Thr Leu Gin Gly Arg Tyr 

35 40 45 

He Lys Lys Gly Ser Arg Ser He Ser Arg Leu Gly Phe Lys Ser Cys 

50 55 60 

Ser Cys Val Leu Leu Pro Lys His Ala He Leu Phe Ser Ser Arg Ala 
65 70 75 80 

Pro He Gly Tyr Val Ala He Ala Glu Lys Arg Leu Cys Thr Asn Gin 

85 90 95 

Gly Phe Lys Ser He He Pro Asn Lys Lys He Tyr Phe Glu Phe Leu 

100 105 HO 

Tyr Tyr Leu Leu Lys Tyr Tyr Lys Asp Asn He Ser Asn He Gly Gly 

115 120 125 

Gly Thr Thr Phe Lys Glu Val Ser Gly Ala Thr Leu Gly Ser He Pro 
130 135 140 

Ser 
145 

(2} INFORMATION FOR SEQ ID NO: 137: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1. . .273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 



Met 


Glu 


Phe 


Met 


Lys 


Lys 


Phe 


Val 


Ala 


Leu 


Gly 


Leu 


Leu 


Ser 


Ala 


Val 


1 








5 








10 










15 




Leu 


Ser 


Ser 


Ser 


Leu 


Leu 


Ala 


Glu 


Gly 


Asp 


Gly 


Val 


Tyr 


He 


Gly Thr 








20 










25 










30 




Thr 


Asn 


Tyr 


Gin 


Leu 


Gly 


Gin 


Ala 


Arg 


Leu 


Asn 


Ser 


Asn 


He 


Tyr 


Asn 




35 










40 










45 




Thr 


Ala 


Gly 


Asp 


Cys 


Thr 


Gly 


Ser 


Val 


Val 


Gly 


Cys 


Pro 


Pro 


Gly 


Leu 


50 










55 










60 


His 






Tyr 


Asn 


Lys 


His 


Asn 


Pro 


Gly 


Gly 


Thr 


Asn 


He 


Asn 


Trp 


Ser 


Lys 


65 








70 










75 










80 


Ala 


Asn 


Gly 


Ala 


Leu 


Asn 


Gly 


Phe 


Gly 


Leu 


Asn 


Val 


Gly 


Tyr 


Lys 


Lys 








85 








90 










95 


Phe 


Phe 


Phe 


Gin 


Phe 


Lys 


Ser 


Leu 


Asp 


Met 


Thr 


Ser 


Lys 


Trp 


Phe 


Gly 








100 








105 










110 




Gin 


Arg 


Val 


Tyr 


Gly 


Leu 


Phe 


Asp 


Tyr 


Gly 


His 


Ala 


Asp 


Leu 


Gly 


Lys 




115 










120 










125 








Val 


Tyr 


Ala 


Pro 


Asn 


Lys 


He 


Gin 


Leu 


Asp 


Met 


Val 


Ser 


Trp Gly Val 




130 










135 










140 




Ala 




Phe 


Gly 


Ser 


Asp 


Leu 


Leu 


Ala 


Asp 


He 


He 


Asp 


Lys 


Asp 


Asn 


Ser 


145 








150 










155 










160 


Gly 


He 


Phe 


Gly 


Gly 


Val 


Ala 


He 


Gly 


Gly 


Asn 


Thr 


Trp 


Lys 


Ser 


Ser 






165 










170 










175 




Ala 


Ala 


Asn 


Tyr 


Trp 


Lys 


Glu 


Gin 


He 


He 


Glu 


Ala 


Lys 


Gly 


Pro 


Asp 








180 








185 










190 






Val 


Cys 


Thr 


Pro 


Thr 


Tyr 


Cys 


Asn 


Pro 


Asn 


Ala 


Pro 


Tyr 


Ser 


Thr 


Asn 




195 










200 










205 


Val 




Ala 


Thr 


Ser 


Thr 


Val 


Ala 


Phe 


Gin 


Val 


Trp 


Leu 


Asn 


Phe 


Gly 


Arg 




210 










215 








220 




Val 






Asn 


He 


Tyr 


Lys 


His 


Asn 


Gly 


Val 


Glu 


Phe 


Gly 


Val 


Arg 


Pro 


Leu 


225 






230 










235 




Thr 






240 


Leu 


He 


Asn 


Lys 


Phe 


Leu 


Ser 


Ala 


Gly 


Pro 


Asn 


Ala 


Asn 


Leu 


Tyr 








245 










250 










255 
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Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu Gly Tyr Asn Tyr Thr 
260 265 270 

Phe 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) OKUANISM: Helicobacter pyluii 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .148 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 



Leu 


Val 


Gin 


lie 


Val 


Val 


Val 


Phe 


Tyr 


Gly 


Leu 


Pro 


Ala 


Leu 


Gly 


Val 


1 








5 










10 










15 




Tyr 


Met 


Asp 


Pro 


lie 


Pro 


Ala 


Gly 


He 


He 


Ala 


Phe 


Ser 


Phe 


Asn 


Val 








20 










25 










30 






Gly 


Ala 


Tyr 


Ala 


Ser 


Glu 


Thr 


Leu 


Arg 


Ala 


Ser 


Phe 


Leu 


Ser 


Val 


Pro 






35 










40 










45 








Lys 


Asp 


Gin 


Trp 


Asp 


Ser 


Ser 


Leu 


Ser 


Leu 


Gly 


Leu 


Asn 


Tyr 


Leu 


Gin 




50 










55 








60 








Thr 


Phe 


Trp 


His 


Val 


lie 


Phe 


Phe 


Gin 


Ala 


Leu 


Lys 


Val 


Ala 


Thr 


Pro 


65 










70 










75 








80 


Ser 


Leu 


Ser 


Asn 


Thr 


Phe 


He 


Ser 


Leu 


Phe 


Lys 


Glu 


Thr 


Ser 


Leu 


Ala 










85 










90 










95 




Ser 


Val 


Val 


Thr 


He 


Ala 


Glu 


Xaa 


Phe 


Arg 


He 


Ala 


Gin 


Gin 


Lys 


Xaa 








100 










105 










110 




Asn 


Val 


Ser 


Tyr 


Asp 


Phe 


Xaa 


Pro 


He 


Tyr 


Leu 


Glu 


Xaa 


Ala 


Leu 


He 






115 










120 










125 








Tyr 


Trp 


Leu 


Phe 


Cys 


Leu 


Val 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Arg 


Val 


Glu 



130 135 140 



Lys He Leu Asn 
145 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_£ eature 

(B) LOCATION 1. . .134 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

Val Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys Ser 

1 ' 5 ■ 10 15 

Leu Ala Glu He Asn Ser Thr lie Met Val He Val Gin Glu He Asn 
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20 25 30 

Asp Val Ser Ser Gin Met Asn Leu Asn Ser Gin Lys Met Glu Arg Leu 

35 40 45 

Ser Asp Met Ser Lys Ser Val Gin Glu Thr Tyr Glu Lys Met Ser Ser 

50 55 60 

Asn Leu Ser Ser Val Val Leu Asp Ser Asn Gin Ser Met Asp Asp Tyr 
65 70 *?5 80 

Ala Lys Ser Gly His Gin lie Glu Ala Met Val Ser Asp Phe Ala Glu 

85 90 95 

Val Glu Lys Val Ala Ser Lys Thr Leu Ala Asp Ser Ser Asp lie Leu 

100 105 HO 

Asn He Ala Thr His Val Ser Gly Thr Thr Met Asn Leu Xaa Lys Gin 

115 120 125 

Val Asn Leu Phe Lys Thr 
130 

(2) INFORMATION FOR SEQ ID NO: 200; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . .13 3 



15 

Leu Ala He Leu Met Va] 
30 

He Ala Leu Pro Ser Gl> 
45 

Val He Glu He Arg Met 
60 

Gin Thr Tyr Glu Tyr Xac 
75 80 
Lys Lys Tyr Asp Lys Asj 
95 

Leu Thr Tyr Asp Lys Va] 
110 

Gly Phe Leu Lys Val Se] 
125 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


Met 


Asn 


Tyr 


Asp 


Asn 


Tyr 


Trp 


Asp 


Glu 


Asp ! 


1 






5 










10 


Thr 


Pro 


Leu 


Val 


Asp 


Val 


Met 


Leu 


Val 


Leu i 








20 










25 


Glu 


Thr 


Thr 


Pro 


Thr 


Leu 


Thr 


Tyr 


Lys 


Glu 






35 










40 






Ser 


Lvs 


Thr 


Ala 


Arg 


Ala 


Thr 


Gin 


Asp 


Lys 




50 










55 








Asp 


Lys 


Asp 


Ala 


Lys 


He 


Tyr 


He 


Asp 


Ser i 


65 










70 










Ser 


Phe 


Pro 


Asp 


Thr 


Phe 


Asn 


Leu 


Leu 


Ser 








85 










90 


Thr 


Arg 


Val 


Ser 


He 


Arg 


Ala 


Asp 


Lys 


Arg 






100 










105 




He 


Tyr 


Leu 


Leu 


Lys 


Thr 


He 


Lys 


Glu 


Ala 




115 










120 






Leu 


He 


Thr 


Ser 


Pro 














130 


















(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO:201; 





SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



182 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .71 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 



Met 


Pro 


Pro 


Thr 


Xaa 


Pro 


Gin 


Ala 


Ser 


He 


Leu 


Arg 


Leu 


Thr 


Leu 


Lys 


1 








5 










10 










15 


Asn 


Pro 


Leu 


Xaa 


Xaa 


Leu 


Ser 


Arg 


Tyr 


Ser 


Leu 


Cys 


Leu 


Leu 


Lys 


Lys 








20 










25 










30 


Thr 


Arg 


Leu 


Gin 


Thr 


Thr 


Ser 


Asn 


Ser 


Ala 


Pro 


Lys 


Ala 


Cys 


Leu 


He 






35 










40 








45 




Ala 


Gly Leu 


Leu 


Lys 


Lys 


Ser 


Lys 


Pro 


Phe 


He 


Leu 


Asn 


Thr 


Leu 


Lys 




50 










55 










60 








He 


Arg 


Ser 


Leu 


Leu 


Lys 


Pro 



















65 70 
(2) INFORMATION FOR SEQ ID N0.2C2. 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .217 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



Met 


Pro 


Val 


He 


Arg 


Val 


Leu 


Val 


Met 


Leu 


Ala 


Thr 


Met 


Met 


Met 


Lys 


1 








5 










10 










15 


Leu 


Val 


Lys 


Thr 


Ala 


Lys 


Glu 


Lys 


Lys 


Val 


Phe 


Lys 


Asn 


Val 


Gly 


lie 








20 










25 










30 




Ser 


He 


Met 


Gly 


He 


Ala 


Phe 


Trp 


Glu 


Ala 


He 


Lys 


Asp 


Ser 


He 


Lys 




Gin 


35 










40 










45 






Lys 


He 


Lys 


Lys 


Ser 


Asp 


Trp 


He 


Cys 


Gly 


Asn 


Val 


Lys 


Thr 


Ala 




50 










55 










60 








Asp 


Asp 


Tyr 


Leu 


Lys 


Thr 


His 


Pro 


Asn 


Ser 


Trp 


Phe 


Asn 


Ser 


Ala 


lie 


65 


Val 








70 










75 










80 


Gly 


Thr 


Ala 


He 


Thr 


Ala 


Met 


Leu 


Met 


Asn 


Val 


Cys 


Phe 


Ala 


Asp 




Gin 






85 










90 








95 


Asp 


Ser 


Lys 


Lys 


Glu 


Val 


Ala 


Gin 


Ala 


Gin 


Lys 


Glu 


Ala 


Glu 


Asn 


Ala 






100 










105 








110 




Arg 


Asp 


Arg 


Ala 


Asn 


Lys 


Ser 


Gly 


lie 


Glu 


Leu 


Glu 


Gin 


Glu 


Glu 


Gin 




115 










120 










125 






Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


130 










135 










140 






Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Glu 


145 


Glu 








150 










155 






160 


Gin 


Lys 


Gin 


Lys 


Thr 


Ser 


Asn 


lie 


Glu 


Thr 


Asn 


Asn 


Gin 


lie 


Lys 


Val 








165 










170 










175 


Glu 


Gin 


Glu 


Gin 


Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Asn 




Thr 




180 










185 










190 






Asn 


Gin 


Lys 


Asp 


Leu 


Val 


Asn 


Lys 


Ala 


Glu 


Gin 


Asn 


Cys 


Gin 


Glu 




His 


195 










200 










205 






Asn 


Asn 


Gin 


Phe 


Phe 


lie 


Lys 


Asn 
















210 










215 



















(2) INFORMATION FOR SEQ ID NO:203: 
(ii SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



183 

(A) LENGTH: 7 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ixj FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 



Met 


val 


lie ber 


Gly 


His 


Phe 


Thr Thr 


Tyr 


Ser 


Tyr 


He 


Glu Flu 


Phe 


1 






5 








10 








15 




He 


He 


Gin He 


Ser 


Gin 


Phe 


Ser Pro 


Asp 


lie 


Thr 


Thr 


Leu Met 


Leu 




20 








25 










30 




Phe 


Val 


Phe Gly 


Leu 


Ala 


Gly 


Val Val 


Gly 


Ser 


Phe 


Leu 


Phe Gly 


Arg 






35 








40 








45 






Leu 


Tyr 


Ala Lys 


Asn 


Ser 


Arg 


Lys Phe 


lie 


Ala 


Phe 


Ala 


Met Val 


Leu 




50 








55 








60 








Val 


He 


Cys Pro 


Gin 


Pro 


Leu 


Ala. Phe 


Cys 


Val 










65 






70 








75 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 204 : 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
<B) LOCATION 1. . .192 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204 



Met 


Lys 


Ser 


Thr 


Arg 


He 


Gly 


Ser 


Lys 


lie 


Val 


Met 


Met Val 


Cys 


Ala 


1 






5 










10 










15 




Val 


Val 


He 


Val 


lie 


Ser 


Ala 


Val 


Met 


Gly 


Val 


lie 


lie 


Ser 


Tyr 


Lys 








20 










25 










30 






Val 


Glu 


Ser 


Val 


Leu 


Gin 


Ser 


Gin 


Ala 


Thr 


Glu 


Leu 


Leu 


Gin 


Lys 


Lys 






35 










40 










45 






Phe 


Ala 


Gin 


Leu 


Val 


Ser 


Phe 


Lys 


He 


Gin 


Gly 


lie 


Met 


Lys 


Arg 


He 




50 










55 










60 








Ala 


Met 


Gly Ala 


Asn 


Thr 


Leu 


Glu 


Arg 


Phe 


Leu 


Ser 


Asp 


Glu 


Asn 


Gly 


65 










70 










75 










80 


lie 


Asn 


Asp 


Thr 


Leu 


Lys 


Arg 


Arg 


Met 


Leu 


Ser 


Glu 


Phe 


Leu 


Leu 


Ala 








85 










90 










95 




Asn 


Pro 


His 


Val 


Leu 


Leu 


Val 


Ser 


Ala 


lie 


Tyr 


Thr 


Asn 


Asn 


Asn 


Glu 








100 










105 










110 






Arg 


Met 


lie 


Thr 


Ala 


Met 


Asn 


Met 


Asp 


Ser 


Lys 


He 


Ala 


Tyr 


Pro 


Asn 




115 










120 










125 








Thr 


Ala 


Leu 


Asn 


Glu 


Asn 


Met 


Thr 


Xaa 


Pro 


lie 


His 


Ser 


Leu 


Lys 


Ser 




130 










135 










140 








lie 


lie 


Thr 


Arg 


Ser 


Xaa 


Pro 


Tyr 


Tyr 


Lys 


Glu 


Val 


Asn 


Xaa 


Xaa 


Lys 


145 








150 










155 










160 


Tyr 


Xaa 


Xaa 


Xaa 


He 


Thr 


Leu 


Pro 


Leu 


Xaa 


Xaa 


Lys 


Asn 


Xaa 


Asn 


Xaa 
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165 170 175 

lie Xaa Xaa Leu Asn Phe Xaa Leu Asn lie Asp Xaa Phe Leu Tyr Xaa 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

fix) FEATURE: 

(A) NAME /KEY : misc.feature 
{B) LOCATION 1. . .253 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 



Met 


Ala 


Tyr 


Lys 


Tyr 


Asp 


Arg 


Asp 


Leu 


Glu 


Phe 


Leu 


Lys 


Gin 


Leu 


Glu 


1 








5 










10 








15 




Ser 


Ser 


Asp 


Leu 


Leu 


Asp 


Leu 


Phe 


Glu 


Val 


Leu 


Val 


Phe 


Gly 


Lys 


Asp 








20 










25 










30 


Gly 


Glu 


Lys 


Arg 


His 


Asn 


Glu 


Lys 


Leu 


Thr 


Ser 


Ser 


lie 


Glu 


Tyr 


Lys 






35 










40 










45 




Arg 


His 


Gly 


Asp 


Asp 


Tyr 


Ala 


Lys 


Tyr 


Ala 


Glu 


Arg 


He 


Ala 


Glu 


Glu 




50 










55 










60 










Leu 


Gin 


Tyr 


Tyr 


Gly 


Ser 


Asn 


Ser 


Phe 


Ala 


Ser 


Phe 


He 


Lys 


Gly 


Glu 


65 










70 










75 






80 


Gly 


Val 


Leu 


Tyr 


Lys 


Glu 


He 


Leu 


Cys 


Asp 


Val 


Cys 


Asp 


Lys 


Leu 


Lys 










85 










90 










95 


Val 


Asn 


Tyr 


Asn 


Lys 


Lys 


Thr 


Glu 


Thr 


Thr 


Leu 


lie 


Glu 


Gin 


Asn 


Met 








100 










105 










no 






Leu 


Ser 


Lys 


He 


Leu 


Glu 


Arg 


Ser 


Leu 


Glu 


Glu 


Met 


Asp 


Asp 


Glu 


Glu 






115 










120 










125 






Val 


Lys 


Glu 


Met 


Cys 


Asp 


Glu 


Leu 


Ser 


He 


Lys 


Asn 


Thr 


Asp 


Asn 


Leu 




130 










135 










140 








Asn 


Arg 


Gin 


Ala 


Leu 


Ser 


Ala 


Ala 


Thr 


Leu 


Thr 


Leu 


Phe 


Lys 


Met 


Gly 


145 










150 










155 








160 


Gly 


Phe 


Lys 


Ser 


Tyr 


Gin 


Leu 


Ala 


Val 


lie 


Val 


Ala 


Asn 


Ala 


Val 


Ala 










165 










170 










175 




Lys 


Thr 


He 


Leu 


Gly 


Arg 


Gly 


Leu 


Ser 


Leu 


Ala 


Gly Asn 


Gin 


Val 


Leu 








180 










185 










190 






Thr 


Arg 


Thr 


Leu 


Ser 


Phe 


Leu 


Thr 


Gly 


Pro 


Val 


Gly Trp 


He 


He 


Thr 


Gly 




195 










200 










205 








Val 


Trp 


Thr 


Ala 


He 


Asp 


He 


Ala 


Gly 


Pro 


Ala 


Tyr 


Arg 


Val 


Thr 




210 










215 










220 






He 


Pro 


Ala 


Cys 


He 


Val 


Val 


Ala 


Thr 


Leu 


Arg 


Leu 


Lys 


Thr 


Gin 


Gin 


225 










230 










235 








240 


Ala 


Asn 


Glu 


Asp 


Lys 


Lys 


Ser 


Leu 


Gin 


He 


Glu 


Ser 


Val 









245 250 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .293 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 



Leu 


Leu 


Leu 


Phe 


He 


Val 


Val 


He 


Thr 


Ser 


Leu 


Val 


Lys 


Asn 


Thr 


lie 


1 








5 










10 










15 




Pro 


Asn 


He 


Trp 
20 


Leu 


Thr 


Lys 


He 


Leu 
25 


Tyr 


Met 


Ala 


lie 


Leu 
30 


Leu 


Cys 


Ala 


He 


Ala 


His 


Ser 


Val 


Gly 


Xaa 


He 


Leu 


Arg 


Trp Tyr 


Val 


Ser 


Gly 






35 










40 










45 








His 


Ser 


Pro 


Trp 


Ser 


Asn 


Ala 


Tyr 


Glu 


Ser 


Met 


Phe 


Tyr 


lie 


Ala 


Trp 


50 








55 










60 










Ala 


Ser 


Val 


He 


Ala 


Gly 


Phe 


Val 


Leu 


Arg 


Xaa 


Lys 


Leu 


Ala 


Leu 


Ser 


65 








70 










75 






His 




30 


Ala 


Ser 


Ser 


Phe 


Leu 


Ala Gly 


lie 


Ala 


Leu 


Phe 


Val 


Ala 


Leu 


Gly 








85 










90 










95 




Phe 


Met 


Asp 


Pro 


Gin 


He 


Gly 


Pro 


Leu 


Val 


Pro 


Val 


Leu 


Lys 


Ser 


Tyr 






100 










105 










110 






Trp 


Leu 


Asn 


lie 


His 


Val 


Ser 


Val 


lie 


Thr 


Ala 


Ser 


Tyr Gly 


Phe 


Leu 




115 










120 










125 








Gly 


Leu 


Cys 


Phe 


Val 


Leu 


Gly 


lie 


Leu 


Ser 


Leu 


Val 


Leu 


Phe 


lie 


Leu 


130 








135 










140 






lie 




Arg 


Lys 


Gin 


Gly 


Arg 


Phe 


Asn 


Leu 


Asp 


Lys 


Thr 


He 


Leu 


Ser 


Ser 


145 








150 










155 










160 


Ala 


He 


Asn 


Glu 


Met 
165 


Ser 


Met 


He 


Leu 


Gly 
170 


Leu 


Phe 


Met 


Leu 


Thr 
175 


Ala 


Gly 


Asn 


Phe 


Leu 


Gly 


Gly Val 


Trp 


Ala 


Asn 


Glu 


Ser Trp Gly Arg Tyr 






180 










185 










190 




Val 


Trp Gly 


Trp 


Asp 


Pro 


Lys 


Glu 


Thr 


Trp 


Ala 


Leu 


He 


Ser 


He 


Cys 






195 










200 










205 








Tyr 


Ala 


Leu 


He 


Leu 


His 


Leu 


Arg 


Phe 


Leu 


Gly 


Ser 


Gin 


Asn 


Trp 


Pro 


210 










215 










220 










Phe 


He 


Leu 


Ala 


Ser 


Ser 


Ser 


Val 


Leu 


Gly 


Phe 


Tyr 


Ser 


Val 


Leu 


Met 


225 










230 










235 










240 


Thr 


Leu 


Phe 


Trp 


Arg 


Glu 


Leu 


Leu 


Pro 


Phe 


Trp 


Leu 


Ala 


Gin 


Leu 


Cys 






245 










250 










255 




Arg 


Arg 


Xaa 


Ser 


Phe 


Ala 


Asp 


Pro 


Tyr 


Phe 


Phe 


He 


Leu 


Phe Gly 


Ser 




260 










265 










270 






Asp 


Thr 


Phe 


Arg 


Ser 


Arg 


He 


Leu 


Ala 


Tyr 


Phe 


Lys 


Arg 


His 


Leu 


Ser 




275 










280 










285 








Leu 


Pro 
290 


Lys 


Leu 


Val 

























(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .142 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207 

Val Glu Met lie His Thr Gin Asp Tyr lie Lys Met Glu Glu Ala Ala 
15 10 15 
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Thr 


Glu 


Ala 


He 


Lys 


Arg 


Lys 


Glu 


Ser 


Ser 


lie 


Tyr 


Leu 


Gly 


Met 


Asp 








0 n 
z U 










25 










30 






Lieu 


Lys 


Asn 


biy 


Ala 


Asp 


Ala 


Leu 


lie 


Ser 


Ala 


Gly 


His 


Ser 


Gly 






J j 










40 










45 






Ala 


Thr 
50 


Met 


Glv 


Leu 


Ala 


Thr 
55 






r .on 




ax y 

60 


lie 


ijys 


uiy 


vol 


Glu 


Arg 


Pro 


Ala 


He 


Cys 


Thr 


Leu 


Met 


Pro 


Ser 


Val 


Gly 


Lys 


Arg 


Pro 


65 










70 










75 








80 


Ser 


Val 


Leu 


Leu 


Asp 


Ala 


Gly 


Ala 


Asn 


Thr 


Asp 


Cys 


Lys 


Pro 


Glu 


Tyr 










85 










90 










95 


Leu 


He 


Asp 


Phe 
100 


Ala 


Leu 


Met 


Gly 


Tyr 
105 


Glu 


Tyr 


Ala 


Lys 


Ser 
110 


Val 


Leu 


His 


Tyr 


Asp 


Ser 


Pro 


Lys 


Val 


Gly 


Leu 


Leu 


Ser 


Asn 


Gly 


Glu 


Glu 


Asp 






115 










120 










125 






He 


Lys 


Gly 


Gly 


He 


Arg 


Ser 


Leu 


Lys 


Lys 


Arg 


He 


Lys 


Cys 







130 135 140 



U) INFORMATION FOK SiiO. ID No: 2 Go: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 144 amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .144 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 



Met 


Leu 


Glu 


He 


Lys 


Asn 


Leu 


Asn 


Cys 


Val 


Leu 


Asn 


Ser 


His 


Phe 


Ser 


1 








5 










10 










15 




Leu 


Gin 


Asn 


He 


Asn 


He 


Ser 


Leu 


Ser 


Tyr 


Ser 


Glu 


Arg 


Val 


Ala 


lie 








20 










25 










30 






Val 


Gly Glu 


Ser 


Gly 


Ser 


Gly 


Lys 


Ser 


Ser 


lie 


Ala 


Asn 


Leu 


Val 


Met 






35 










40 










45 








Arg 


Leu 


Asn 


Pro 


Arg 


Phe 


Lys 


Ser 


His 


Asn 


Gly Glu 


lie 


Leu 


Phe 


Glu 




50 










55 










60 










Thr 


Thr 


Asn 


Leu 


Leu 


Lys 


Glu 


Ser 


Glu 


Ala 


Phe 


Xaa 


Gin 


His 


Leu 


Arg 


65 










70 










75 










80 


Gly Asn 


He 


lie 


Ala 


Tyr 


lie 


Ala 


Gin 


Asp 


Pro 


Leu 


Ser 


Ser 


Leu 


Asn 










85 










90 










95 




Pro 


Leu 


His 


Lys 


lie 


Gly 


Lys 


Gin 


Met 


Ser 


Glu 


Ala 


Tyr 


Phe 


Leu 


His 








100 










105 










110 






His 


Lys 


Asn 


Ala 


Ser 


Gin 


Val 


Ser 


Leu 


Asn 


Glu 


Gin 


Val 


Leu 


Asn 


Val 






115 










120 










125 








Met 


Lys 


Gin 


Val 


Gin 


Leu 


Asp 


Glu 


Asn 


Phe 


Trp 


Asn 


Val 


Ser 


Leu 


Met 



130 135 140 



(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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187 



(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 ... 83 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 209 



Met 


Asn Tyr 


Lys 


Val 


Ala 


Ser Ala Arg Asn 


He 


Ala 


Thr 


Leu 


Leu 


Phe 


1 






5 




10 










15 




Leu 


Phe Phe 


Ser 


Gin 


Ser 


Glu Ala Phe Asp 


Leu 


Gly 


Lys 


He 


Ala 


Lys 






20 






25 






30 




lie 


Lys Ala 


Gly 


Ala 


Glu 


Ser Phe Ser Lys 


Val 


Gly 


Phe 


Asn 


Asn 


Lys 




35 








40 






45 






Pro 


He Asn 


Xaa 


Asn 


Lys 


Gly He Tyr Pro 


Thr 


Glu 


Thr 


Phe 


Met 


Thr 




50 








55 




60 










He 


Asn Gly 


Leu 


His 


Ala 


Gly Gly Phe Tyr Gly 


Ala 


Leu 


Ala 


Gin 


Lys 


65 








70 




75 










80 


Arg 


Tyr Gly 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:210: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



{ii> MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A J ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_ feature 

(B) LOCATION 1 . . .130 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:210 



Met 


Asp 


Ala 


Leu 


Glu 


He 


Thr 


Gin 


Lys 


Leu He 


Ser 


Tyr 


Pro 


Thr 


He 


1 








5 










10 






15 




Thr 


Pro 


Lys 


Glu 


Cys 


Gly 


He 


Phe 


Glu 


Tyr lie 


Lys 


Ser 


Leu 


Phe 


Pro 








20 










25 






30 






Ala 


Phe 


Lys 


Thr 


Leu 


Glu 


Cys 


Glu 


Lys 


Asn Gly Val 


Lys 


Asn 


Leu 


Phe 






35 










40 








45 








Leu 


Tyr 


Arg 


He 


Phe 


Asn 


Pro 


Leu 


Lys 


Lys His 


Ala 


Glu 


Lys 


Glu 


His 




50 










55 








60 








Ala 


Lys 


Glu 


Lys 


His 


Val 


Lys 


Glu 


Asn 


Val Xaa 


Pro 


Leu 


His 


Phe 


Cys 


65 










70 








75 










80 


Xaa 


Ala 


Gly 


His 


He 


Xaa 


Val 


Val 


Pro 


Pro Gly 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 










85 










90 








95 




Asp 


Ser 


Phe 


Xaa 


Xaa 


He 


lie 


Lys 


Glu 


Gly Phe 


Leu 


Tyr 


Gly Arg 


Gly 








100 










105 








110 




Ala 


Gin 


Asp 


Met 


Lys 


Gly 


Gly 


Val 


Gly Xaa Phe 


Xaa 


Arg 


Cys 


Xaa 


Xaa 



115 120 125 



Lys Phe 
130 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 40 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
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Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc„f eature 

(B) LOCATION 1. . .340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 



Met 


lie 


Leu 


Ser 


He 


Glu 


Ser 


Ser 


Cys 


Asp 


Asp 


Ser 


Ser 


Leu 


Ala 


Leu 


1 








5 










10 










15 




Thr 


Arg 


lie 


Glu 


Asp 


Ala 


Lys 


Leu 


He 


Ala 


His 


Phe 


Lys 


He 


Ser 


Gin 








20 










25 








30 






Glu 


Lys 


His 


His 


Ser 


Ser 


Tyr 


Gly Gly 


Val 


Val 


Pro 


Glu 


lie 


Ala 


Ser 






35 










40 










45 








Arg 


Leu 


His 


Ala 


Glu 


Asn 


Leu 


Pro 


Leu 


Leu 


Leu 


Glu 


Arc 


Val 


Lys 


lie 




50 










55 










60 






Ser 


Leu 


Asn 


Lys 


Asp 


Phe 


Ser 


Lys 


lie 


Lys 


Ala 


lie 


Ala 


lie 


Thr 


Asn 


65 










70 










75 










80 


Gin 


Pro 


Gly 


Leu 


Ser 


Val 


Thr 


Leu 


lie 


Glu 


Glv 


Leu 


Met 


Met 


Ala 












85 










90 










95 


Ala 


Leu 


Ser 


Leu 


Ser 


Leu 


Asn 


Leu 


Pro 


Leu 


lie 


Leu 


Glu 


Asp 


His 


Leu 








100 










105 










110 






Arg 


Gly 


His 


Val 


Tyr 


Ser 


Leu 


Phe 


He 


Asn 


Glu 


Lys 


Gin 


Thr 


Arg 


Met 






115 










120 








125 






Pro 


Leu 


Ser 


Val 


Leu 


Leu 


Val 


Ser Gly 


Glv 


His 


Ser 


Leu 


lie 


Leu 


Glu 




130 










135 










140 










Ala 


Arg 


Asp 


Tyr 


Glu 


Asp 


He 


Lys 


lie 


Val 


Ala 


Thr 


Ser 


Leu 


Asp 


Asp 


145 










150 










155 








160 


Ser 


Phe 


Gly 


Glu 


Ser 


Phe 


Asp 


Lys 


Val 


Ser 


Lys 


Met 


Leu 


As D 


Leu 


Gly 










165 










170 










175 


Tyr 


Pro 


Gly 


Gly 


Pro 


He 


Val 


Glu 


Lys 


Leu 


Ala 


Leu 


Asp 


Tyr 


Ala 


His 








180 










185 








190 






Pro 


Asn 


Glu 


Pro 


Leu 


Met 


Phe 


Pro 


lie 


Pro 


Leu 


Lys 


Asn 


Ser 


Pro 


Asn 






195 










200 








205 








Leu 


Ala 


Phe 


Ser 


Phe 


Ser 


Gly 


Leu 


Lys 


Asn 


Ala 


Val 


Arg 


Leu 


Glu 


Val 




210 










215 










220 








Glu 


Lys 


Asn 


Ala 


His 


Asn 


Leu 


Asn 


Asp 


Glu 


Val 


Lys 


Gin 


Lys 


lie 


Gly 


225 










230 










235 






240 


Tyr 


His 


Phe 


Gin 


Ser 


Ala 


Ala 


He 


Glu 


His 


Leu 


lie 


Gin 


Gin 


Thr 


Lys 










245 










250 










255 


Arg 


Tyr 


Phe 


Lys 


He 


Lys 


Arg 


Pro 


Lys 


He 


Phe 


Gly 


lie 


Val 


Gly 


Gly 








260 










265 










270 


Ala 


Ser 


Gin 


Asn 


Leu 


Ala 


Leu 


Arg 


Lys 


Ala 


Phe 


Glu 


Asp 


Leu 


Cys 


Ala 






275 










280 










285 






Glu 


Phe 


Asp 


Cys 


Glu 


Leu 


Val 


Leu 


Ala 


Pro 


Leu 


Glu 


Phe 


Cys 


Ser 


Asp 




290 










295 










300 






Asn 


Ala 


Ala 


Met 


He 


Gly 


Arg 


Ser 


Ser 


Leu 


Glu 


Ala 


Tyr 


Gin 


Lys 


Lys 


305 










310 










315 






320 


Arg 


Phe 


He 


Pro 


Leu 


Glu 


Lys 


Ala 


Asp 


lie 


Ser 


Pro 


Arg 


Thr 


Leu 


Leu 










325 










330 










335 




Lys 


Asn 


Phe 


Glu 
































340 


























(2) 


INFORMATION 


FOR 


SEQ 


It) NO:212: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .168 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:212 



Met 


Leu 


Ser 


Ser 


Asn 


Asp 


Leu 


Phe 


Met 


Val 


Val 


Leu 


Gly Ala 


He 


Leu 


1 








5 










10 










15 




Leu 


Val 


Leu 


Val 


Cys 


Leu 


Val 


Gly 


Tyr 


Leu 


Tyr 


Leu 


Lys 


Glu 


Lys 


Glu 








20 










25 










30 






Phe 


Tyr 


His 


Lys 


Met 


Arg 


Arg 


Leu 


Glu 


Lys 


Thr 


Leu 


Asp 


Glu 


Ser 


Tyr 






35 










40 










45 








Gin 


Glu 


Asn 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Arg 


Leu 


Arg 


Glu 


Leu 


Glu Gly Arg 




50 










55 










60 










Leu 


Glu 


Gly 


Leu 


Ser 


Leu 


Glu 


Lys 


Ser 


Ala 


Lys 


Glu 


Asp 


Ser 


Ser 


Leu 


65 










70 










75 










80 


Lys 


Thr 


Thr 


Leu 


Ser 


His 


Leu 


Tyr 


Asn 


Gin 


Leu 


Gin 


Glu 


He 


Gin 


Lys 










65 










90 










55 




Ser 


Met 


Asp 


Lys 


Glu 


Arg 


Asp 


Tyr 


Leu 


Glu 


Glu 


Lys 


He 


He 


Xaa 


Xaa 








100 










105 










110 






Lys 


Thr 


Xaa 


Xaa 


Lys 


Thr 


Trp 


Gly 


lie 


Met 


Pro 


Leu 


Ala 


Met 


Lys 


Ser 






115 










120 










125 








Thr 


Glu 


Lys 


Gin 


Val 


Leu 


Lys 


Met 


Tyr 


Gin Glu Gly 


Tyr 


Ser 


Val 


Asp 




130 










135 










140 










Ser 


lie 


Ser 


Lys 


Glu 


Phe 


Lys 


Val 


Ser 


Lys 


Gly Glu 


Val 


Glu 


Phe 


He 


145 










150 










155 










160 


Leu 


Asn 


Met 


Ala 


Gly 


Leu 


Lys 


Trp 



















165 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 



Leu 


Asp 


Pro 


Phe 


Ser 


His 


Lys 


Glu 


Asn 


Phe 


Leu 


Ala 


Val 


Glu 


Thr 


Phe 


i 








5 










10 










15 




Lys 


Met 


Leu 


Gly 


Lys 


Thr 


Glu 


Ser 


Lys 


Asp 


Asn 


Leu 


Asn 


Trp 


Met 


tie 








20 










25 










30 






Ala 


Leu 


He 


He 


Glu 


Lys 


Asp 


Lys 


Val 


Tyr 


Glu 


Gin 


Val 


Gly 


Ser 


Val 






35 










40 










45 








Arg 


Phe 


Val 


Val 


Val 


Val 


Ala 


Ser 


Ala 


He 


Met 


Val 


Leu 


Ala 


Leu 


He 




50 










55 










60 










He 


Ala 


He 


Thr 


Leu 


Leu 


Met 


Arg 


Ala 


He 


Val 


Ser 


Asn 


Arg 


Leu 


Glu 


65 










70 










75 










80 


Val 


Val 


Ser 


Ser 


Thr 


Leu 


Ser 


His 


Phe 


Phe 


Lys 


Leu 


Leu 


Asn 


Asn 


Gin 










85 










90 










95 




Xaa 


His 


Ser 


Ser 


Xaa 


Xaa 


Lys 


Leu 


Val 


Xaa 


Ala 


Arg 


Ser 


Asn 


Asp 


Glu 








100 










105 










110 






Leu 


Gly Arg 


Xaa 


Gin 


Thr 


Xaa 


Asp 


Xaa 

















115 120 



(2) INFORMATION FOR SEQ ID NO: 214: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .149 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 14 



Met 


Glu 


Phe 


Tyr 


Gin 


Val 


Tyr 


Asp 


Pro 


Leu 


Gly 


His 


He 


Tip 


Leu 


3ei 


1 








5 










10 










15 




Ala 


Leu 


Val 


Ala 


Leu 


Ser 


Pro 


He 


Ala 


Leu 


Phe 


Phe 


He 


Ser 


Leu 


lie 








20 










25 










30 






Val 


Phe 


Lys 


Leu 


Lys 


Gly 


Tyr 


Ser 


Ala 


Gly 


Phe 


Leu 


Ser 


Leu 


Ala 


Leu 






35 










40 










45 








Ser 


lie 


Leu 


He 


Ala 


Leu 


Phe 


Val 


Tyr 


Lys 


Met 


Pro 


Val 


Gin 


Met 


Val 




50 










55 








60 










Ser 


Ala 


Ser 


Phe 


Phe 


Tyr 


Gly 


Phe 


Leu 


Tyr 


Gly 


Leu 


Trp 


Pro 


He 


Ala 


65 










70 










75 








80 


Trp 


He 


Val 


He 


Ala 


Ala 


He 


Phe 


Leu 


Tyr 


Asn 


Leu 


Ser 


Val 


Lys 


Ser 










85 










90 










95 




Gly Tyr 


Phe 


Glu 


He 


Leu 


Lys 


Glu 


Ser 


lie 


Leu 


Ser 


Leu 


Thr 


Pro 


Asp 








100 










105 










110 




His 


Arg 


He 


Leu 


Val 


lie 


Leu 


He 


Gly 


Phe 


Cys 


Phe Gly 


Ser 


Phe 


Leu 






115 










120 










125 








Xaa 


Gly 


Ala 


Xaa 


Gly 


Phe 


Gly 


Gly 


Pro 


Val 


Ala 


He 


Thr 


Ala 


Ala 


He 



130 135 140 



Leu Val Ala Leu Gly 
145 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .325 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 



Met 


Lys Arg 


lie 


Leu 


Val 


Ser 


Leu 


Ala 


Val 


Leu 


Ser 


His 


Ser Ala 


His 


1 








5 










10 








15 




Ala 


Val 


Lys 


Thr 


His 


Asn 


Leu 


Glu 


Arg 


Val 


Glu 


Ala 


Ser 


Gly Val 


Ala 








20 










25 










30 




Asn 


Asp 


Lys 


Glu 


Ala 


Pro 


Leu 


Ser 


Trp 


Arg 


Ser 


Lys 


Glu 


Val Arg 


Asn 






35 










40 










45 






Tyr 


Met 


Gly 


Ser 


Arg 


Thr 


Val 


He 


Ser 


Asn 


Lys 


Gin 


Leu 


Thr Lys 


Ser 




50 










55 










60 






Ala 


Asn 


Gin 


Ser 


lie 


Glu 


Glu 


Ala 


Leu 


Gin 


Asn 


Val 


Pro 


Gly Val 


His 


65 










70 










75 






80 


He 


Arg 


Asn 


Ser 


Thr 


Gly 


He 


Gly 


Ala 


Val 


Pro 


Ser 


lie 


Ser lie 


Arg 
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65 








Gly 


Phe 


Gly 


Ala 


Gly 


Gly 


Pro 


Gly 








100 










Val 


Asn 


Gly 


He 


Pro 


lie 


Tyr 


Val 






115 










120 


Val 


He 


Phe 


Pro 


Val 


Thr 


Phe 


Gin 




130 










135 




Lys 


Gly 


Gly Glu 


Ser 


Val 


Arg 


Tyr 


145 










150 






He 


Asn 


He 


He 


Thr 


Lys 


Gly 


He 










165 








Ser 


Glu 


Arg 


Thr 


Thr 


Phe 


Trp 


Gly 








180 










Asn 


Gin 


Asn 


Ser 


Lys 


Asn 


He 


Asp 






195 










200 


Phe 


Asn 


Thr 


Tyr 


Leu 


Arg 


Thr 


Gly 




210 










215 




lie 


Gin 


Ala 


Gin 


Vai 


Asn 


Trp 


Leu 


225 










230 






Ser 


Pro 


Thr 


Asp 


He 


Gin 


Asn 


Tyr 










245 








Asn 


Asp 


Ser 


Asn 


Lys 


He 


Thr 


Ala 








260 










Leu 


Thr 


Asp 


Pro 


Gly 


Ser 


Leu 


Gly 






275 










280 


Phe 


Gin 


Asn 


Asn 


Arg 


Pro 


Asn 


Asn 




290 










295 




Trp Gly 


Ala 


Val 


Tyr 


Gin 


Asn 


Phe 


305 










310 






Gly Gly 


Phe 


His 


Phe 
















325 









191 





90 










95 




His 


Ser 


Asn 


Thr 


Gly 


Met 


lie 


Leu 


105 










110 






Ala 


Pro 


Tyr 


Val 


Glu 


lie Gly 


Thr 










125 








Ser 


Val 


Asp 


Arg 


lie 


Ser 


Val 


Thr 








140 










Gly 


Pro 


Asn 


Ala 


Phe Gly Gly Val 






155 










160 


Pro 


Thr 


Asn 


Trp 


Glu 


Ser 


Gin 


Val 




170 










175 




Lys 


Ser 


Glu 


Asn 


Gly Gly 


Phe 


Phe 


185 










190 






Lys 


Ser 


Leu 


Val 


Asn 


Asn 


Met 


Leu 








205 








Gly Met 


Met 


Asn 


Lys 


His 


Phe 


Gly 








220 










Lys 


Giy 


Giii Gly 


Phe Ax y 


Tyr 


Asn 






235 










240 


Met 


Leu 


Asp 


Ser 


Leu 


Tyr 


Gin 


He 




250 










255 




Phe 


Phe 


Gin 


Tyr 


Tyr 


Ser 


Tyr 


Phe 


265 










270 






lie 


Ala 


Ala 


Tyr 


Asn 


Gin 


Asn 


Arg 










285 








Asp 


Lys 


Ser 


Gly 


Arg 


Ala 


Lys 


Arg 








300 










Phe Gly 


Asp 


Thr 


Asp 


Arg 


Val 


Gly 






315 










320 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .252 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 



Leu 


Arg 


Ser 


He 


Ser 


Arg 


He 


Lys 


Met 


Leu 


Ser 


Val 


Tyr 


Glu 


Lys 


Gly 


1 








5 










10 










15 




Asn 


Ala 


Leu 


Asp 


Lys 


Arg 


Val 


Leu 


Glu 


Glu 


Trp 


Leu 


Leu 


Ser 


Glu 


Asp 








20 








25 










30 






lie 


Leu 


Met 


Glu 


Asn 


Ala 


Ala 


Met 


Ala 


Leu 


Glu 


Arg 


Ala 


Val 


Leu 


Gin 






35 










40 










45 








Asn 


Ala 


Ser 


Leu 


Gly 


Ala 


Lys 


Val 


lie 


He 


Leu 


Cys 


Gly 


Ser 


Gly Asp 




50 










55 










60 










Asn 


Gly 


Gly 


Asp 


Gly 


Tyr 


Thr 


Leu 


Ala 


Arg 


Arg 


Leu 


Val 


Gly Arg 


Phe 


65 










70 










75 










80 


Lys 


Thr 


Leu 


Val 


Phe 


Glu 


Met 


Lys 


Leu 


Ala 


Lys 


Ser 


Pro 


Met 


Cys 


Gin 








85 










90 










95 




Leu 


Gin 


Lys 


Glu 


Arg 


Ala 


Lys 


Lys 


Val 


Gly 


Val 


Val 


He 


Lys 


Ala 


Trp 








100 










105 










110 






Glu 


Glu 


Lys 


Asn 


Glu 


Asp 


Leu 


Glu 


Cys 


Asp 


Val 


Leu 


Val 


Asp 


Cys 


Vai 






115 










120 










125 








Val 


Gly 


Ser 


Ala 


Phe 


Lys 


Gly 


Gly 


Leu 


Glu 


Pro 


Phe 


Leu 


Asp 


Phe 


Glu 
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130 135 140 



Ser 


Leu 


Ser 


Gin 


Lys 


Ala 


Arg 


Phe 


Lys 


He 


Ala 


Cys 


Asp 


He 


Pro 


Ser 


1 A C 










150 










155 










160 


Gly 


lie 


Asp 


Ser 


Lys 


Gly Arg 


Val 


Asp 


Lys 


Arg 


Ala 


Phe 


Lys 


Xaa 


Gly 










165 










170 










175 




Tyr 


Arg 


Leu 


Ser 


Ala 


Trp 


Ala 


Leu 


Phe 


Lys 


Ser 


Cys 


Leu 


Leu 


Ser 


Xaa 








180 










185 










190 






Lys 


Xaa 


Lys 


Xaa 


Tyr 


He 


Xaa 


Xaa 


Leu 


Lys 


Xaa 


Xaa 


His 


Leu 


Xaa 


Val 






195 










200 










205 








Phe 


Asn 


Gin 


lie 


Tyr 


Glu 


lie 


Pro 


Thr 


Xaa 


Thr 


Phe 


Leu 


Leu 


Glu 


Lys 




210 










215 










220 








Xaa 


Asp 


Leu 


Lys 


Leu 


Pro 


Leu 


Arg 


Asp 


Arg 


Lys 


Lys 


Arg 


Ser 


Gin 


Arg 


225 










230 










235 










240 


Arg 


Leu 


Arg 


Ala 


Cys 


Ala 


Cys 


Ala 


Phe 


Gly 


Gin 


Ala 











245 250 



(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
(B> LOCATION 1. . .138 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 17 



Met 


Ala 


Leu 


Asp 


Lys 


Arg 


He 


Trp 


Met 


His 


Phe 


Asp 


Leu 


Leu 


Pro 


Phe 


1 








5 










10 










15 




Val 


Phe 


He 


He 


Pro 


Leu 


Leu 


Val 


Val 


Ser 


Phe 


Leu 


Leu 


He 


Phe 


Glu 








20 










25 










30 






Ser 


Ser 


Ala 


Val 


Leu 


Ser 


Leu 


Lys 


Gin 


Gly 


Val 


Tyr 


Tyr 


Ala 


He 


Gly 






35 










40 










45 






Phe 


Leu 


Leu 


Phe 


Trp 


Val 


Val 


Phe 


Phe 


He 


Pro 


Phe 


Arg 


Lys 


Leu 


Asp 




50 










55 










60 








Arg 


Trp 


Leu 


Phe 


Ala 


Leu 


Tyr 


Trp 


Ala 


Cys 


Val 


lie 


Leu 


Leu 


Ala 


Leu 


65 










70 










75 










80 


Val 


Asp 


Phe 


Met 


Gly 


Ser 


Ser 


Lys 


Leu 


Gly 


Ala 


Gin 


Arg Trp 


Leu 


Val 










85 










90 










95 




He 


Pro 


Phe 


Thr 


Ser 


lie 


Thr 


Leu 


Gin 


Pro 


Ser 


Glu 


Pro 


Val 


Lys 


Asn 








100 










105 










110 




Arg 


Xaa 


Ser 


Phe 


He 


Val 


Gly 


Ala 


Phe 


Xaa 


Xaa 


Asn 


Xaa 


Pro 


Asp 


Xaa 






115 










120 










125 






Leu 


Leu 


Arg 


Ala 


Met 


He 


Gly 


Ala 


Cys 


Phe 















130 135 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 26 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .32 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218 



Val 


Leu 


Met 


Ala 


Leu 


Xaa 


Asp 


Lys 


Arg 


Tyr 


Gly 


Leu 


Glu 


Ala 


Gly 


He 


1 








5 










10 










15 




Lvs 


Tyr 


Phe 


Thr 


Met 


Gly 


Ala 


Met 


Ala 


Ser 


Ala 


Phe 


Phe 


Ala 


Met 


Gly 




20 










25 










30 






Ala 


Met 


Ala 
35 


Phe 


Tyr 


Leu 


Leu 


Thr 
40 


Gly 


Ser 


Leu 


Asn 


Leu 
45 


Glu 


Val 


He 


Thr 


Leu 


Tyr 


Leu 


His 


Thr 


Glu 


Gly 


He 


Thr 


Asn 


Pro 


Met 


Leu 


Phe 


Ala 




50 








55 










60 










Met 


Gly 


Thr 


He 


Phe 


Leu 


He 


Gly 


Ala 


He 


Gly 


Phe 


Lys 


Val 


Ser 


Leu 


65 








70 










75 










80 


Val 


Pro 


Phe 


His 


Thr 


Trp 


Met 


Pro 


Asp 


Val 


Tyr 


Glu 


Gly Asn Asn 


Pro 










5 5 










90 














Val 


Phe 


Ala 


Ser 


Tvr 


He 


Ser 


He 


Val 


Pro 


Lys 


He 


Ala Gly 


rue 


Val 








100 










105 










110 






Val 


Ala 


Thr 
115 


Arcr 


Leu 


Phe 


Gly 


Ala 
120 


Phe 


He 


Asp 


Thr 


His 
125 


Thr 


Ala 


Trp 


Val 


Glu 


Asp 


He 


Phe 


Tyr 


Val 


Leu 


He 


Leu 


Met 


Thr 


He 


Thr 


He 


Pro 




13 0 








135 










140 










Asn 


Phe 


lie 


Ala 


Leu 


Trp 


Gin 


Glu 


Asp 


Val 


Lys 


Arg 


Met 


Leu 


Ala 


Tyr 


145 










150 










155 










160 


Ser 


Ser 


lie 


Ser 


His 
165 


Ser 


Glv 


Phe 


Ala 


Leu 
170 


Ala 


Cvs 


Val 


Phe 


He 
175 


His 


Thr 


Glu 


Asp 


Ser 


Gin 


Gin 


Ala 


Met 


Phe 


Val 


Tvr 


Trp 


Phe 


Met 


Phe 


Ala 






180 










185 










190 






Phe 


Thr 


Tyr 
195 


He 


Gly 


Ala 


Phe 


Gly 
200 


Leu 


Leu 


Trp 


Leu 


Leu 
205 


Lys 


Ser 


Arg 


Glu 


Lys 
210 


Thr 


Trp 


Asp 


Glu 


Arg 
215 


Tyr 


Asp 


His 


Pro 


Tyr 
220 


Ser 


Lys 


Phe 


Asn 


Gly 


Leu 


He 


Lys 


Thr 


His 


Pro 


Leu 


Val 


Ala 


He 


Leu 


Gly 


Ala 


He 


Phe 


225 










230 










235 










240 


Val 


Phe 


Gly 


Leu 


Ala 


Gly 


He 


Pro 


Pro 


Phe 


Ser 


Val 


Phe 


Trp 


Gly 


Lys 








245 










250 










255 




Phe 


Leu 


Ala 


Val 
260 


Glu 


Ser 


Ala 


Leu 


Glu 
265 


Ser 


Asn 


His 


He 


Leu 
270 


Leu 


Ala 


Val 


Val 


Met 
275 


Leu 


Val 


Asn 


Ser 


Ala 
280 


Val 


Ala 


Ala 


Phe 


Tyr 
285 


Tyr 


Phe 


Arg 


Trp 


Leu 


Val 


Ala 


Met 


Phe 


Phe 


Asn 


Lys 


Pro 


Leu 


Gin 


Thr 


Gin 


Ser 


Tyr 


290 










295 










300 










Ala 


Lys 


Thr 


He 


Phe 


Thr 


Pro 


Lys 


Thr 


Pro 


Pro 


Cys 


Pro 


Phe 


Met 


Arg 


305 








310 










315 










320 


Ser 


Leu 


Leu 


Pro 


Trp 


Arg 























325 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(Di TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .24 0 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:219 
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WO 97/19098 



PCT/US96/18542 



194 





lie 


Asn 


Jser 


Lys 


Lys 


Ser 


Leu 


Lys 


Lys 


Gly 


Leu 


Arg 


Gly Phe 


Phe 


i 
1 








c 
D 










10 






15 




Lys 


lie 


Leu 


Lys 


Asp 


Arg 


Asn 


Gly 


Ala 


His 


Pne 


Ser 


Cys 


Gly Ala 


Thr 


















2b 










30 




ser 




rue 


pi,, 
uiy 


Leu 


Glu 


I ie 


Ala 


Lys 


Ala 


Phe 


Leu 


Gin 


Lys Asn 


His 






J b 










40 










45 




val 


va 1 


Phe 


Gly 


Thr 


Gly 


Arg 


Arg 


Gin 


Glu 


Asn 


Leu 


Gin 


Lys Leu 


Gin 




ay 










55 










60 






Leu 




Tyr 


Pro 


Lys 


Arg 


Phe 


I le 


Pro 


Leu 


Cys 


Phe 


Asp 


Leu Gin 


Asn 












70 










75 






80 


Lys 


Pro 


(jlU 


rnr 


Lys 


Arg 


Ala 


lie 


Glu 


Thr 


lie 


Phe 


Ser 


Met Thr 


Asp 










85 










90 








95 


Arg 


lie 


Asp 


Ala 


Leu 


lie 


Asn 


Asn 


Ala 


Gly 


Leu 


Ala 


Leu Gly Leu 


Asn 








100 










105 










110 




Lys 


Ala 


Tyr 


Glu 


Cys 


Glu 


Leu 


Asp 


Asp 


Trp 


Glu 


Val 


Met 


lie Asp 


Thr 






115 










120 










125 




Asn 


Tl- 
IlS 


Lys 


Gly 


Leu 


Leu 


His 


Leu 


Thr 


Arg 


Leu 


lie 


Leu 


Pro Ser 


Met 




130 










135 










1 4 C 








lie 


Glu 


His 


Asp 


Gin 


Gly 


Thr 


lie 


lie 


Asn 


Leu 


Gly 


Ser 


lie Ala 


Gly 


145 










150 










155 








160 


Thr 


Tyr 


Ala 


Tyr 


Pro 


Gly 


Gly 


Xaa 


Val 


Tyr 


Gly 


Ala 


Ser 


Lys Ala 


Xaa 










165 










170 








175 




Val 


Lys 


Gin 


Xaa 


Ser 


Xaa 


Asn 


Leu 


Arg 


Ala 


Asp 


Val 


Ala Gly Thr 


Asn 








180 










185 










190 




Thr 


Arg 


Gly 


Arg 


Arg 


Trp 


Asn 


Pro 


Gly 


Cys 


Val 


Ala 


Lys 


Pro Lys 


Val 






195 










200 










205 




Ser 


Arg 


Val 


Arg 


Gly 


Lys 


Gly 


Asp 


Lys 


Pro 


Lys 


Pro 


Lys 


Ser Gly 


Tyr 




210 










215 










220 




Glu 


Lys 


His 


Pro 


Leu 


Pro 


Gin 


Thr 


Thr 


Arg 


Gin 


Gly 


Leu 


Thr Ser 


Gly 


225 










230 










235 








240 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 220: 















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



Ui) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .204 



Ui) SEQUENCE DESCRIPTION : SEQ ID NO: 220 



Val 


Ser 


Gly 


Val 


Val 


Leu 


Ser 


Lys 


Phe 


Asp 


Ser 


Asp 


Ser 


Lys 


Gly Gly 


1 








5 










10 










15 




lie 


Ala 


Leu 


Gly 


lie 


Thr 


Tyr 


Gin 


Leu 


Gly 


Leu 


Pro 


Leu 


Arg 


Phe 


He 








20 










25 










30 






Gly 


Ser 


Gly 


Glu 


Lys 


He 


Pro 


Asp 


Leu 


Asp 


Val 


Phe 


Met 


Pro 


Glu 


Arg 






35 










40 










45 






lie 


Val 


Gly 


Arg 


Leu 


Met 


Gly 


Ala 


Gly 


Asp 


He 


He 


Ser 


Leu 


Ala 


Glu 




50 










55 










60 










Lys 


Thr 


Ala 


Ser 


Val 


Leu 


Asn 


Pro 


Asn 


Glu 


Ala 


Lys 


Asp 


Leu 


Ser 


Lys 


65 










70 










75 










80 


Lys 


Leu 


Lys 


Lys 


Gly 


Gin 


Phe 


Thr 


Phe 


Asn 


Asp 


Phe 


Leu 


Asn 


Gin 


He 










85 










90 










95 




Glu 


Lys 


Val 


Lys 


Lys 


Leu 


Gly 


Ser 


Met 


Ser 


Ser 


Leu 


He 


Ser 


Met 


He 








100 










105 










110 






Pro Gly 


Leu 


Gly 


Asn 


Met 


Ala 


Ser 


Ala 


Leu 


Lys 


Asp 


Thr 


Asp 


Leu 


Glu 






115 










120 










125 








Ser 


Ser 


Leu 


Glu 


Val 


Lys 


Lys 


He 


Lys 


Ala 


Met 


Val 


Asn 


Ser 


Met 


Thr 



130 135 140 
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195 

Lys Lys GIu Arg Glu Asn Pro Glu lie Leu Asn Gly Ser Arg Arg Lys 
145 150 155 160 

Arg lie Ala Leu Gly Xaa Gly Leu Glu Xaa Xaa Glu lie Asn Arg lie 

165 170 175 

lie Lys Arg Phe Asp Gin Ala Ser Lys Met Ala Lys Arg Leu Thr Asn 

180 185 190 

Lys Lys Gly lie Ser Asp Leu Met Asn Leu Xaa Xaa 
195 200 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 



Val 


Glu 


Lys 


Ala 


His 


Pro 


Asp 


Val 


Phe 


Asn 


Leu 


Leu 


Leu 


Gin 


Val 


Leu 


1 






5 








10 










15 




Asp 


Glu 


Gly 


His 


Leu 


Thr 


Asp 


Ser 


Lys 


Gly 


Val 


Arg 


Val 


Asp 


Phe 


Lys 






20 










25 










30 






Asn 


Thr 


He 


Leu 


He 


Leu 


Thr 


Ser 


Asn 


Val 


Ala 


Ser 


Gly Ala 


Leu 


Leu 






35 










40 










45 








Glu 


Glu 


Asp 


Leu 


Ser 


Glu 


Ala 


Asp 


Lys 


Gin 


Lys 


Ala 


He 


Lys 


Glu 


Ser 




50 








55 










60 










Leu 


Arg 


Gin 


Phe 


Phe 


Lys 


Pro 


Glu 


Phe 


Leu 


Asn 


Arg 


Leu 


Asp 


Glu 


He 


65 








70 










75 










80 


lie 


Ser 


Phe 


Asn 


Ala 
85 


Leu 


Asp 


Ser 


His 


Ala 
90 


He 


He 











(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .82 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222 

Leu Val Phe Leu Asp Arg Arg Leu lie Val Met Val Thr Asp Ser Lys 

15 10 15 

Gly Ser Arg Tyr He Asn Val His He Leu Phe Arg Gin He Ser Leu 

20 25 30 

Tyr Ala Leu Leu Ser Val Val Gly Ser Leu Leu Phe Leu Gly Val Ser 

35 40 45 

Leu Leu Val Leu Asn Lys Glu He Lys Asn lie Glu Lys Gin His Ala 
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SO 55 60 

Leu Xaa Thr Lys GIu Phe Glu Lys Lys Arg Glu Thr Asn Glu Xaa Leu 
65 70 75 80 

Ser Xaa 



(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

ivij ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .233 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:223 



Leu Ser Leu Met 
1 

Thr Arg Ser Val 
20 

Gin Leu Gin Ser 
35 

Lys Leu Val Lys 
50 

Thr Val Leu Asn 
65 

lie Lys Tyr Thr 

Leu Thr Leu lie 
100 

Lys Gly Val Lys 
115 

Ala Phe Thr Leu 
130 

Glu Glu Ser Val 
145 

Arg Glu Leu Val 

Thr Leu His Asp 
180 

Gin Glu Gin Gin 
195 

Glu Tro He He 
210 

Pro He Lys Ala 
225 

(2) INFORMATION 



Xaa 


Val 


Leu 


Asn 


5 








Lys 


Tyr 


His 


Gin 


Tyr 


Lys 


Met 


Ala 








40 


Asp 


Lys 


Lys 


Pro 






55 




Thr 


Phe 


Asp 


Tyr 




70 






Pro 


Glu 


Thr 


Trp 


85 








Pro 


Gly 


Ala 


Leu 


He 


Phe 


Tyr 


He 








120 


Lys 


Thr 


Leu 


Lys 






135 




Leu 


Leu 


Lys 


Glu 




150 






Ala 


Lys 


Asp 


Tyr 


165 








Phe 


Asp 


Ala 


He 


Ala 


Lys 


Val 


Leu 








200 


Leu 


Pro 


Asn 


Ser 






215 




Trp 


Gin 


Asn 


Lys 




230 






FOR 


SEQ 


ID NO: 2i 



Ala 


Lys 


Glu 


Cys 




10 






Gin 


Ser 


Ala 


Glu 


25 








Lys 


Met 


Ala 


Leu 


Ala 


Val 


He 


Leu 








60 


Ala 


Gly 


Tyr 


Leu 






75 




Asp 


Lys 


Phe 


Glu 




90 






Asp 


Phe 


Leu 


Glu 


105 








Ser 


Asn 


Arg 


Thr 


Ser 


Phe 


Lys 


Leu 








140 


Lys 


Gly 


Lys 


Pro 






155 




Ala 


He 


Val 


Leu 




170 






Phe 


Ala 


Lys 


Asp 


185 








Gin 


Asn 


Ala 


Gin 


Leu 




Gly 


Thr 








220 


Lys 









Val Xaa Pro He 
15 

lie Arg Ala Leu 
30 

Asp Asn Asn Leu 
45 

Asp Leu Asp Glu 

Val Lys Asn Cys 
80 

Lys Glu Gly Ser 
95 

Tyr Ala Asn Ser 
110 

Gin Lys Asn Lys 
125 

Pro Gin Val Ser 

Lys Ala Val Arg 
160 

Gin Val Gly Asp 
175 

Ala Lys Asn Ser 
190 

Lys Phe Gly Thr 
205 

Trp Glu Asp Gly 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 85 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 



Met 


Leu 


Ala 


Ala 


Gly 


Leu 


Thr 


Leu 


Pro 


Glu 


Phe 


Gly 


Cys 


Tyr 


Leu 


Ser 


1 








5 










10 










15 




His 


Tyr 


Leu 


Leu 


Trp 


Lys 


Glu 


Cys 


Val 


Lys 


Leu 


Asp 


Gin 


Pro 


Val 


Val 








20 










25 










30 






He 


Leu 


Glu 


Asp 


Asp 


Val 


Thr 


Leu 


Glu 


Ser 


His 


Phe 


Met 


Gin 


Ala 


Leu 






35 










40 










45 








Glu 


Asp 


Cys 


Leu 


Lys 


Ser 


Pro 


Phe Asp 


Phe 


Val 


Arg 


Leu 


Tyr 


Gly 


Cys 




50 










55 










60 










Tyr 


Trp 


Tyr 


Tyr 


Gin 


Arg 


Asp 


Lys 


He 


Pro 


Cys 


Phe 


Ala 


Gin 


Arg 


He 


65 










70 










75 










SO 


Cys 


He 


Ser 


Ser 


Leu 

























85 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .115 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 5 



Leu 


He 


Ala 


Leu 


Arg 


Val 


Thr 


Ala 


Trp 


Lys 


Val 


Xaa 


Ala 


Met 


Lys 


Arg 


1 ' 








5 










10 










15 




Leu 


His 


Leu 


Ser 


Val 


Lys 


Asp 


Ala 


Glu 


Asn 


Phe 


Asp 


Ala 


He 


Leu 


Arg 








20 










25 










30 






Glu 


Arg 


Pro 


Phe 


Phe 


Lys 


Asp 


Leu 


He 


Glu 


Phe 


Met 


Val 


Ser 


Gly 


Pro 






35 










40 










45 








Val 


Val 


Val 


Met 


Val 


Leu 


Glu 


Gly Lys 


Asp 


Ala 


Val 


Ala 


Lys 


Asn 


Arg 




50 










55 










60 










Glu 


Leu 


Met 


Gly 


Ala 


Thr 


Asp 


Pro 


Lys 


Leu 


Ala 


Gin 


Lys 


Gly 


Thr 


He 


65 










70 










75 










80 


Arg 


Ala 


Asp 


Phe 


Ala 


Glu 


Ser 


He 


Asp 


Ala 


Asn 


Ala 


Val 


His 


Gly 


Ser 










85 










90 










95 




Asp 


Ser 


Leu 


Glu 


Asn 


Ala 


His 


Asn 


Glu 


He 


Ala 


Phe 


Phe 


Phe 


Ala 


Ala 



100 105 110 



Arg Glu Phe 
115 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .394 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 6 



Leu 


Met 


Trp 


Leu 


Lys 


Thr 


Leu 


Thr 


Leu 


Gin 


Thr 


Leu 


Asn 


Thr 


Asp 


Lys 


1 








5 










10 










15 


Ala 


Leu 


Gin 


Glu 


Phe 


Ser 


Lys 


Thr 


Met 


Glu 


Ala 


Phe 


Lys 


Thr 


Lys 


Leu 








20 










25 








30 




lie 


Gin 


Ser 


Ala 


Asn 


Asp 


Val 


His 


Ser 


Glu 


Thr 


Ser 


Arg 


Ala 


Ala 


He 






35 










40 










45 








Ala 


Asn 


Asp 


Leu 


Glu 


Arg 


Leu 


Lys 


Glu 


His 


Met 


He 


Asn 


Val 


Ala 


Asn 




50 










55 










60 










Thr 


Ser 


lie 


Giy 


Giy 


Glu 


Phe 


Leu 


Phe 


Giy 


uly 


Ser 


Lys 


Val 


Asp 


Arg 


65 










70 










75 










80 


Pro 


Pro 


lie 


Asp 


Ser 


Asn 


Gly 


Lys 


Tyr 


His 


Gly 


Asn 


Gly Glu Asp 


Leu 










85 










90 










95 




Asn 


Ala 


Leu 


He 


Ser 


Ser 


Asp 


Asn 


Leu 


Val 


Pro 


Tyr 


Asn 


He 


Ser 


Gly 








100 










105 








110 






Gin 


Asp 


Leu 


Phe 


Leu 


Gly 


Thr 


Asp 


Lys 


Asp 


Lys 


His 


Lys 


Leu 


He 


Thr 






115 










120 










125 








Thr 


Asn 


He 


Lys 


Leu 


Leu 


Asn 


Gin 


Asn 


Lys 


Leu 


Xaa 


Pro 


Asp 


Val 


Met 




130 










135 










140 










Asp 


Ala 


Leu 


Glu 


His 


Ser 


Ser 


Leu 


Pro 


Glu 


Glu 


Val 


Phe 


He 


Lys 


Pro 


145 










150 










155 










160 


Ser 


Asp 


Thr 


Leu 


Arg 


Glu 


Leu 


He 


Gly 


Asp 


Asn 


Asp 


Lys 


Asn 


Pro 


Thr 










165 










170 










175 




Asn 


Asp 


Pro 


Lys 


Glu 


Phe 


Phe 


Tyr 


Leu 


Gin 


Gly 


He 


Arg 


Pro 


Asp 


Gly 








180 










185 










190 






Ser 


Ser 


Phe 


Lys 


Glu 


Lys 


Phe 


Ala 


Leu 


Asp 


Lys 


Ala 


Tyr 


Gin 


Asn 


Gin 






195 










200 










205 








Glu 


Ser 


Ala 


Thr 


Lys 


Val 


Ser 


Asp 


Leu 


Leu 


Asp 


Lys 


He Gly His 


Ala 




210 










215 










220 










Tyr 


Gly Asn 


Thr 


Ser 


Gin 


Asn 


Lys 


Val 


Val 


Asp 


Val 


Ser 


Leu 


Asn 


Asn 


225 










230 










235 










240 


Trp Gly Gin 


He 


Glu 


He 


Lys 


Asn 


Leu 


Thr 


Pro 


Gly 


Ser 


Glu 


Asn 


Leu 










245 










250 








255 




Asp 


Phe 


His 


Leu 


lie 


Ser 


Ser 


Asp 


Gly 


Asp 


Phe 


Asp 


Asp 


Leu 


Asp 


Ala 








260 










265 










270 






Leu 


Arg 


Ser 


Ser 


Gly 


Lys 


Arg 


Val 


Thr 


Glu 


Tyr 


Val 


Lys 


Ser 


Ala 


Phe 






275 










280 










285 








Val 


Thr 


Asp 


Arg 


Ser 


Leu 


Ser 


Gin 


Val 


Lys 


Ala 


Val 


Pro 


Asn 


Met 


Tyr 




290 










295 








300 










Asn 


Pro 


Lys 


Val 


Leu 


Glu 


He 


Pro 


Ser 


Val 


Phe 


Val 


Thr 


Lys 


Asp 


Asn 


305 










310 










315 








320 


Val 


Leu 


Ala 


Asn 


Lys 


Asn 


Thr 


Lys 


Leu 


Ser 


Glu 


He 


Phe 


Gly Asp 


Lys 










325 










330 










335 




Val 


Glu 


Thr 


Leu 


Lys 


He 


Asn 


Ala 


Ser 


Arg 


Leu 


Gly 


Asp 


Glu 


Ser 


Ala 








340 










345 










350 






He 


Lys 


He 


Pro 


Asn 


Leu 


Pro 


He 


Asn 


Leu 


Asp 


He 


Pro 


He 


Leu 


Leu 






355 










360 








365 








Asp 


Val 


Lys 


Asn 


Ser 


Thr 


He 


Lys 


Asp 


Leu 


Lys 


Asp 


Ala 


He 


Lys 


Glu 




370 










375 










380 










Arg 


Phe 


Asn 


Asn 


Glu 


Gly 


Gly 


Cys 


Gly 


Asn 















385 390 



(2) INFORMATION FOR SEQ ID NO:227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .102 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 



Leu 


Lys 


Ala 


Leu 


Asn 


Asp 


Cys 


Met 


Val 


Phe 


Phe 


His 


Lys 


Lys 


He 


He 


1 ' 








5 










10 










15 




Leu 


Asn 


Phe 


He 
20 


Tyr 


Ser 


Leu 


Met 


Val 
25 


Ala 


Phe 


Leu 


Phe 


His 
30 


Leu 


Ser 


Tyr 


Gly 


Val 
35 


Leu 


Leu 


Lys 


Ala 


Asp 
40 


Gly 


Met 


Ala 


Lys 


Lys 
45 


Gin 


Thr 


Leu 


Leu 


Val 
50 


Gly 


Glu 


Arg 


Leu 


Vai 
55 


Trp 


Asp 


Lys 


beu 


'rhr 
60 


Leu 


Leu 


Giy 


Phe 


Leu 


Glu 


Lys 


Asn 


His 


He 


Pro 


Gin 


Lys 


Leu 


Tyr 


Tyr 


Asn 


Leu 


Ser 


Ser 


65 










70 










75 










80 


Gin 


Asp 


Lys 


Glu 


Leu 
85 


Ser 


Ala 


Glu 


He 


Gin 
90 


Ser 


Asn 


Val 


Thr 


Tyr 
95 


Tyr 


Xaa 


Phe 


Lys 


Arg 


Cys 


Lys 























100 



(2) INFORMATION FOR SEQ ID NO: 22 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
<B) LOCATION 1. . .363 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 



Met 


Lys 


Phe 


Phe 


Leu 


Leu 


Lys 


Lys 


Phe 


Ser 


Xaa 


Phe 


Leu 


Asn 


Thr 


Gin 


1 








5 










10 










15 




Thr 


His 


Phe 


Asn 


Leu 


Lys 


Arg 


Leu 


Asn 


Ala 


Ser 


Ser 


Phe 


Leu 


Leu 


Glu 








20 










25 










30 






Thr 


Phe 


Ser 


Lys 


Glu 


Lys 


His 


Ala 


Phe 


Val 


Val 


Asp 


Leu 


Ser 


Ala 


Pro 






35 










40 










45 








Tyr 


He 


Gly 


Leu 


Ser 


Lys 


Lys 


Pro 


Pro 


Glu 


Ser 


Val 


Leu 


Lys 


Asn 


Thr 




50 










55 










60 










Leu 


Ala 


Leu 


Asp 


Phe 


Cys 


Leu 


Asn 


Lys 


Phe 


Thr 


Lys 


Asn 


Ala 


Lys 


He 


65 










70 










75 










80 


Leu 


Gin 


Ala 


Asn 


Val 


He 


Asp 


Asn 


Asp 


Arg 


He 


Leu 


Glu 


He 


Lys 


Gly 










85 










90 










95 




Ala 


Lys 


Asp 


Leu 


Ala 


Tyr 


Lys 


Ser 


Glu 


Thr 


Phe 


He 


Leu 


Arg 


Leu 


Glu 








100 










105 










110 






Met 


lie 


Pro 


Lys 


Lys 


Ala 


Asn 


Leu 


Met 


lie 


Leu 


Asp 


Gin 


Glu 


Lys 


Cys 






115 










120 










125 








Val 


He 


Glu 


Ala 


Phe 


Arg 


Phe 


Asn 


Asp 


Arg 


Val 


Ala 


Lys 


Asn 


Asp 


He 




130 










135 










140 










Leu 


Gly 


Ala 


Leu 


Pro 


Pro 


Asn 


He 


Tyr 


Glu 


His 


Gin 


Glu 


Glu 


Asp 


Leu 


145 










150 










155 










160 


Asp 


Phe 


Lys 


Gly 


Leu 


Leu 


Asp 


He 


Leu 


Glu 


Lys 


Asp 


Phe 


Leu 


Ser 


Tyr 










165 










170 










175 




Gin 


His 


Lys 


Glu 


Leu 


Glu 


His 


Lys 


Lys 


Asn 


Gin 


He 


He 


Lys 


Arg 


Leu 
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180 185 190 



Asn 


Ala 


Gin 


Lys 


Glu 


Arg 


Leu 


Lys 


Glu 


Lys 


Leu 


Glu 


Lys 


Leu 


Glu 


Asp 






195 










200 










205 






Pro 


Lys 
210 


Thr 


Leu 


Gin 


Leu 


Glu 
215 


Ala 


Lys 


Glu 


Leu 


Gin 
220 


Thr 


Gin 


Ala 


Ser 


Leu 


Leu 


Leu 


Thr 


Tyr 


Gin 


His 


Leu 


He 


Asn 


Arg 


Arg 


Glu 


Asn 


Arg 


Val 


225 










230 










235 








240 


He 


Leu 


Lys 


Asp 


Phe 


Glu 


Asp 


Lys 


Glu 


Cys 


Met 


He 


Glu 


He 


Asp 


Lys 










245 










250 










255 


Ser 


Met 


Pro 


Leu 


Asn 


Ala 


Phe 


He 


Asn 


Lys 


Lys 


Phe 


Thr 


Leu 


Ser 


Lys 








260 










265 










270 




Lys 


Lys 


Lys 


Gin 


Lys 


Ser 


Gin 


Phe 


Leu 


Tyr 


Leu 


Glu 


Glu 


Glu 


Asn 


Leu 






275 










280 








285 








Lys 


Glu 


Lys 


lie 


Ala 


Phe 


Lys 


Glu 


Asn 


Gin 


He 


Asn 


Tyr 


Val 


Arg 


Asp 




290 










295 










300 






Ala 


Ala 


Glu 


Glu 


Ser 


Val 


Leu 


Glu 


Met 


Phe 


Met 


Pro 


Val 


Lys 


Asn 


Ser 


305 










310 










315 








320 


Lys 


lie 


Lys 


Arg 


Pro 


Met 


Asn 


Giy 


Tyr 


Glu 


Vet I 


Leu 


Tyt 


7yi 


Ly a 


Asp 










325 










330 










335 


Xaa 


Lys 


Xaa 


Gly 


Leu 


Gly 


Lys 


Thr 


Lys 


Lys 


Arg 


He 


Ser 


Ser 


Phe 


Tyr 








340 










345 










350 




Lys 


Thr 


Gin 


Xaa 


Arg 


Met 


He 


Xaa 


Gly 


Cys 


Xaa 













355 360 
(2) INFORMATION FOR SEQ ID NO: 22 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 
ATATCCATGG TGAGTTTGAT GA 
(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_£eature 

(B) LOCATION 1. . .25 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 
ATGAATTCAA TTTTTTATTT TGCCA 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

{iii) HYPOTHETICAL: NO 

(XV) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 
AATTCCATGG TGGGGGCTAT G 
(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 
ATGAATTCTC GATAGCCAAA ATC 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .24 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO:233 
ATTTCCATGG TCATGTCTCA TATT 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) YuPGLOGx: circular 

(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION 1. . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234 
ATGAATTCCA TCTTTTATTC CAC 
(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:235 
AACCATGGTG ATTTTAAGCA TTGAAAG 
(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

{iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1. . .28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:236 
AAGAATTCCA CTCAAAATTT TTTAACAG 
( 2 ) INFORMATION FOR SEQ ID NO; 23 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 
GATCATCCAT ATGTTATCTT CTAAT 
(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 
TGAATTCAAC CATTTTAACC CTG 
(2) INFORMATION FOR SEQ ID NO: 23 9: 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
.(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

( B) LOCATION i . . . 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9 
TATACCATGG TGAAATTTTT TCTTTTA 
(2) INFORMATION FOR SEQ ID NO:240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240 
AGAATTCAAT TGCGTCTTGT AAAAG 
(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .25 
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<xi) SEQUENCE DESCRIPTION SEQ ZD NO: 241 
TTATGGATCC AAACCAATTA AAACT 
(2) INFORMATION FOR SEQ ID NO:242: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc^feature 

(B) LOCATION 1. . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242 
TATCTCGAGT TATAGAGAAG GGC 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
ATATCCATGG TGAGTTTGAT GA 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
ATGAATTCAA TTTTTTATTT TGCCA 
(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iiij HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 
(Bi LOCATION 1. . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:245 

AATTCCATGG CTATCCAAAT CCG 

(2) INFORMATION FOR SEQ ID NO: 24 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 
ATGAATTCGC CAAAATCGTA GTATT 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 
GATACCATGG AATTTATGAA AAAG 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 
TGAATTCGAA AAAGTGTAGT TATAC 
(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
TTGAACACTT TTGATTATGC GG 
(2) INFORMATION FOR SEQ ID NO: 250: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250 
GGATTATGCG ATTGTTTTAC AAG 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : xnisc„£ eature 

(B) LOCATION 1 . . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 51 
GTCTTTAGCA AAAATGGCGT C 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 
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AATGAGCGTA AGAGAGCCTT C 2 
(2) INFORMATION FOR SEQ ID NO: 253; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii} HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

( vi ) ORIGINAL SOURCE; 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 
CTTATGGGGG TATTGTCA 1 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix| FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 



AGCATGTGGG TATCCAGC 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



1 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

{iv} ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 
AGGTTGTTGC CTAAAGACT 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE; DMA ; genomic] 

(iii) HYPOTHETICAL: NO 

(ivi ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_£ eature 

(B) LOCATION 1. . .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 
CTGCCTCCAC CTTTGATC 

(2) INFORMATION FOR SEQ ID NO: 257 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .19 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2 57 
ACCAATATCA ATTGGCACT 
(2) INFORMATION FOR SEQ ID NO: 25 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iiii HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258 
ACTTGGAAAA GCTCTGCA 
(2) INFORMATION FOR SEQ ID NO: 259: 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

fii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

CTTGCTTGTC ATATCTAGC 

(2) INFORMATION . FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 
GTTGAAGTGT TGGTGCTA 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .22 

Ui) SEQUENCE DESCRIPTION: SCQ ID NO : 2 £ 1 

CAAGCAAGTG GTTTGGTTTT AG 22 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 
TGGAAAGAGC AAATCATTGA AG 22 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 21 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 
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GCCCATAATC AAAAAGCCCA T 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA < genomic) 

(iii> HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME /KEY : misc_f eature 
(BJ LOCATION 1. . .24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 64 

CTAAAACCAA ACCACTTGCT TGTC 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265 
GTAAAACGAC GGCCAG 

(2) INFORMATION FOR SEQ ID NO:266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) H YPOTHETI CAL : NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 
CAGGAAACAG CTATGAC 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 
ATCTTACCTA TC AC CTC AAA T 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_£ eature 

(B) LOCATION 1. . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68 
AGACAGCAAC ATCTTTGTGA A 
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5 CLAIMS 

1. A substantially pure nucleic acid encoding an H. pylori polypeptide, said nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID 
NOs:l-114. 

2. A substantially pure nucleic acid from naturally occurring H. pylori which 
hybridizes under stringent conditions to a nucleic acid which encodes an K pylori 
polypeptide, said nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO:M 14. 

3. A method of evaluating a compound for the ability to bind an H. pylori nucleic 
acid comprising: contacting said compound with an H. pylori nucleic acid selected from 
the group consisting of SEQ ID NO: 1-1 14 and determining if said compound binds said K 
pylori nucleic acid. 

4. The method of claim 3, wherein said compound is an activator of the bacterial 
life cycle. 

5. The method of claim 3, wherein said compound is an inhibitor of the bacterial 
25 life cycle. 

6. The method of claim 3, wherein said method is performed in vitro. 

7. The method of claim 3, wherein said method is performed in vivo. 

8. A method of generating a vaccine for immunizing a subject against H. pylori 
infection comprising: immunizing said subject with a nucleic acid encoding an H. pylori 
polypeptide or a fragment thereof, said nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-1 14, and a therapeutically acceptable 
carrier. 

9. A method of detecting the presence of a Helicobacter species in a sample 
comprising: 

contacting said sample with a nucleic acid encoding an H. pylori polypeptide, said 
40 nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NO:M14; 

hybridizing said sample to said nucleic acid; 
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5 said hybridization being indicative of the presence of said Helicobacter species in said 
sample. 

10. The method of claim 9, wherein said Helicobacter species is H pylori. 

10 11- The method of claim 9, wherein said nucleic acid is 20 or more nucleotides in 

length. 

12. A method of inhibiting expression of a gene from a Helicobacter species 
comprising: administering to said species an H pylori antisense nucleic acid selected from 

15 the group consisting ofSEQ ID NO: 1-1 14. 

13. The method of claim 12, wherein said Helicobacter species is H pylori, 

14. The method of claim 12, wherein said antisense nucleic acid is administered in 
20 a carrier. 

15. The method of claim 12, wherein said carrier is a liposome or a bacteriophage. 

16. The method of claim 12, wherein said antisense nucleic acid is 20 or more 
25 nucleotides in length. 

17. The method of claim 12, wherein said antisense nucleic acid is capable of 
binding to Helicobacter nucleic acid or mRNA. 

30 18. A method of generating a vaccine for immunizing a subject against H pylori 

infection comprising: immunizing said subject with a nucleic acid encoding an H. pylori 
polypeptide or a fragment thereof, said polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l 15-228, and a therapeutically 
acceptable carrier. 

35 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



1/15 



PCT/US96/18542 



o o < 

■h a o 

iJ H « 

r *; o 

* -4 

H • 1* 

o c « 

J3 U O 



o a • 

a « ** 

a 14 o 

-4 O M 

C 0. 



u ^ o. c e e 

>*, V X «4 



3 

N 
r- 

5 



It 

ri _ 

o ft- 

o (V 

2 J 



HI 



15 

rv 

a. 
S 




3 O 
U 55 



* * I s h n te t» .r»w * a> r» tf» r» -r* « » 

o n >««to r- . * •aonwmniOhbaoo •or* 

S « 2 t 2 W 10 IOf»8l/INrtOOlwrilfllft(Dft<l>(OciO^ 

£2^! 2 ttm ® -■-<Hf»tr-«ctf»«-»ot»nt©mt©inocM«"t©« 
[•♦mOnomo^esooDiBeDUB^oOMOioiHfliON 



• or* *n o rtiAN » f» w * - • •» ottsr* • 

tgninnoHninAinhOSwoisinMnrtOHh 
rttArif k ir>woor*'OriricvCD0io«ot<nH*>f^mr i i 



WO 97/19098 



2/15 



PCMJS96/I8542 



r 



e 

£3 



s 



h -H iJ 



H 

IE 
4 a 



o « 
a2 



to 

4 



4 $S 

Or ^ M 




H 
Z 

o 

u 

tJ 




£ £ £ £ E £ E £ £ S S S S 3 £ £ £ S S £ £ £ 2 2 E £ E 2 £ g 2 2 2 £ £ 2 £ 2 2 2 2 2 2 i £ «>V » « o o 2 3 



WO 97/19098 3/15 PCT/US96/18542 



rn 





2S5s"Ss"5SS2H"SS52SS3S3555^^ 



WO 97/19098 



4/15 



PCT/US96/18542 




WO 97/19098 



5/15 



PCT/US96/18542 




WO 97/19098 



6/15 



PCT/US96/18542 



<0 -» 



* -4 9 

'8 a 



o u c 

U O V* 



o o 

DC U & 



0- 



J 



> E £ 
S3 



r* c* co 
t-t *■» t-* 

V> O V 



3 2 

H M 



£2 

§1 
is 



5 2 



i tip 

o o a 



I. 



.1 



Q 

H 
Z 

o 
u 

es 
=> 



S3 



• I- ft O -I * 



• « -r-o n w» » r» r- * cm 
to««*u>f»tin«*o<*f-fto»<-tu>c»r-( 



f H N M . 
I" « H Iflh 
OSN 



WO 97/19098 



7/15 



PCT/US96/18542 




WO 97/19098 



8/15 



PCT/US96/18542 



oe 



Q 1? 
•h a. 



•j 

** < 

i a 
u o 



I 

jfc. 



3 

3 



?! 



3 »-» 

2! 



'3 

Ou 



u 



r 

o 

(V 



O 

u 

w 
OS 




)»OHnn«mwr0«oH(. . . 

)aAAAAft«t«)A«AOOOOOO( 



WO 97/19098 



9/15 



PCT7US96/18542 




WO 97/19098 



10/15 



PCT/US96/18542 




!ffllli«l!lIilI«fllllIli!ilMI 



WO 97/19098 



11/15 



PCT/US96/18542 



! 



1; 



ti *£ 

i i ¥ I 

III! 

¥ I \ I 



1 iii 



i. 



h ! i 



h 

Jl 



Jl 



Z 

o 

u 

<3 



lliflllill 



3SS'3S33SSS3SSSSSiS^SSSS98 



WO 97/19098 



12/15 



PCT/US96/18542 




WO 97/19098 



13/15 



PCIYUS96/18542 




WO 97/19098 



15/15 



PCT/US96/18542 



5 J 



5 

N 



L\\\\\\\\\\\\\^ 



<0 



CM 



5 



5 



SO 



in 
Ed 

es 



cfu/25 mmz MUCOSA 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCTAJS96/18542 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :C07H 21/04 
US CL :536/23.7 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536/23.7 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 

APS, MEDLINE. BIOSIS, CA, DERWENT 
search terms: H. pylori, vaccine, gene, protein 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



A 
A 

A, P 



US 5,403,924 A (COVER et al.) 04 April 1995. 
US 5,434,253 A (THOMPSON et al.) 18 July 1995. 
US 5,527,678 A (BLASER et al.) 18 June 1996. 



1-7 
1-7 
1-7 



| | Further documents are listed in the continuation of Box C. See patent family annex. 



"A* 

•F 

m V 

'0* 

•r 



Special catef orie» of cited i Ii w ibmm i i: 

documeatdefuuof the fCDcrml Me of the art which m not considered 
to be of partocular relevaace 

carber document pubhabed oo or after the mut a ti onal filing date 

docwDcnt wbicfe may throw doubo oa priority chinu» or which ■ 
cited to establiab be publication date of toother citaboa or other 
i*on<» specified) 

ii fill jag to aa oral disclosure, inc. eibibibon or other 



kuer docunsentpuMafaed after (be atemational fUinf date or priority 
dale aod oot n conflict witb ibe application but chad to undentsnd the 
prime or Cheery tadertyint the amotion 



•r 



rt ii nan ft of pmrbcukr relevance; die classed mvesboo cannot be 
considered novel or cannot be considered to involve aa inventive atop 
when the <*~~"~*~" it taken alone 

document of particular relevance: the claimed invention cannot be 
considered to nvoWe an Bvontfvc etep when the document ii 



betas obvioui lo a peraoa skilled in the art 
di n mum i n aKnobcr of the nunc pateoi famiJy 



Date of the actual completion of the international search 
13 MARCH 1997 



Date of mailing of the international search report 




Name and mailing address of the ISA/US 
Commissioner of PiUots and Trademarks 

Box per 

Washington, D.C. 20231 
Facsimile No . (703) 305-3230 



Autho; 



Telephone No. (703) 308-0196 



Form PCT/ISA/210 (second sheetKJuly 1^2)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US96/18542 



Box 1 Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. I I Claims Nos.: 

' — ' because ihcy relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos.: 

' — ' because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. [_J Claims Nos.: 

because they arc dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Obcerrations where unity of intention a lacking (Continuation of kern 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1. I I As all required additional search fee* were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. rn As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.; 



4, nj] No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
— restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1.7 regarding SEQ ID NO: 9, 46, 59, 69, 83, 97, 98, 101 , 109, and 1 14 



Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

[ | No protest accompanied the payment of additional search fees. 



Form PCT/1SA/210 (continuation of first sheet(l)XJuly 1992)* 



/ 



THIS PAGE BLANK 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 



kTFADED TEXT OR DRAWING 

□ BLURRED OR ILLEGD3LE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

Q'LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: : 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




THIS PAGE BLANK (usfto) 



